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(57) This invention relates to a protein that is a Zy- 
gfomycefes-derived endoglucanase lacl<ing the cellu- 
lose-binding domain and exhibits endoglucanase activ- 
ity, and a method for using the same. This invention can 
enhance effects of an endoglucanase enzyme on fabric 
treatment such as reduction of fuzzing, Improvement in 



feel and appearance, color clarification, partial color 
change, and softening of cellulose-containing fabrics 
and on perfomnance improvement in the deinking of 
waste paper and drainage of paper pulp. 
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Description 

Technical Field 

5 [0001 ] The present invention relates to an endoglucanase enzyme, the cellulose-binding domain of which has been 
deleted, with enhanced effects in the treatment of cellulose-containing fabrics and with applications regarding deter- 
gents or paper pulp, a method for producing the same, and a cellutase preparation with enhanced effects. 

Background Art 

10 

[0002] Treatment of cellulose-containing fabrics with cellulase is can-ied out to provide the fabrics with desired prop- 
erties. For example, treatment with cellulase is carried out in the fabric industry in order to improve the feel and ap- 
pearance of cellulose-containing fabrics or to give colored cellulose-containing fabrics an appearance of "stone- 
washed" material, i.e., partial color change (European Patent No. 307,564). 
15 [0003] Colored cellulose-containing fabrics are known to become fuzzy after repeated washings and to lose their 
vividness. Incorporation of cellulase into a detergent can remove fuzz and make the color of fabrics vivid, i.e., clarify 
the color (European Patent No. 220,016). Thus, detergents containing cellulase are commercially available mainly in 
Europe and America. 

[0004] In the aforementioned application, cellulases derived from Trichoderma or Humicola (both are wood-rotting 

20 fungi) are mainly used. Recently used cellulase preparations are produced by isolating endoglucanases, which are 
highly active in fabric treatment, from these cellulase components and enhancing their effects with genetic engineering 
in order to improve commercial efficiency. Examples of these highly active endoglucanases include: Humicola insolens- 
derived EG V (WO 91/17243) and NCE4 (WO 98/03640) that strongly act on cotton fabrics; and Rhizopus oryzae- 
derived ROE I, RCE II, and RCE ill, Mucor circineHoides-demed MCE I and IVICE II, and Phycomyces niters-derived 

25 PCE I (WO 00/24879) that strongly act on lyocell fabrics. 

[0005] Among endoglucanases used in the aforementioned applications, EG V (WO 91/17243), NCE4 (WO 
98/03640), and RCE I, RCE II, RCE III, MCE I, MCE II. and PCE I (WO 00/24879) are presumed to belong to the same 
family (family 45) because of their amino acid sequences, and these enzymes have common structural properties. 
Specifically, each of these endoglucanases comprises a cellulose-binding domain for binding to cellulose as its sub- 

30 strate (hereinafter referred to as "CBD") , a catalytic active domain as an active center (hereinafter refen'ed to as "CAD") , 
and a linker domain with a high hydrophilic amino acid residue content for linking these two domains. 
[0006] EG V, an endoglucanase belonging to family 45, was studied using an enzyme, the CBD domain of which 
had been deleted (JP Patent Publication (PCT Translation) No. 9-500667, Enzyme and Microbial Technology, 27(2000), 
325-329). However, no improvement has been reported regarding the activity of endoglucanase for removing fuzz from 

35 cellulose fabrics through the deletion of the cellulose-binding domain (CBD). There are still many unclarified matters 
concerning the role of the cellulose-binding domain of endoglucanase in the exhibition of endoglucanase activity, and 
research thereof is limited to cellulase derived from a specific fungus, namely, Trichoderma (Kiovula, A. et al., Tnchode- 
rma Gfiociadium, 2, (1 998), 3-23). There has been no detailed research on the cellulose-binding domain of the Zygo- 
mycetes-derived endoglucanase, 

40 [0007] Up to the present, several contrivances have been made in order to improve the effect or perfomriance of 
cellulase in the above applications. For example, mutation was applied to an enzyme for improvement thereof, or 
culture conditions were modified in order to improve the productivity of the enzyme. Due to the high cost of cellulase 
to be used, however, the effect of cellulase should be further improved in order to provide a cellulase preparation that 
is worth using at an industrially practical level. Recently used cellulase preparations are produced by reinforcing only 

45 endoglucanase, which is highly active in fabric treatment, with genetic engineering in order to improve commercial 
efficiency. Accordingly, it is desirable to further improve the activity of such highly active cellulase. 

Disclosure of the Invention 

so [0008] An object of the present invention is to provide an endoglucanase with improved activity, a cellulase prepa- 
ration comprising the same, and various methods for treating cellulose-containing fabrics using the same. 
[0009] The present inventors have conducted concentrated studies on the Zygomycetes-6er\yed endoglucanase 
concerning the role of a cellulose-binding domain when acting on cellulose-containing fabrics. As a result, they have 
found that an endoglucanase lacking the cellulose-binding domain has much higher activity of removing fuzz from 

55 cotton, lyocell, or the like compared to an endoglucanase having a cellulose-binding domain. This led to the completion 
of the present Invention. 

[0010] Specifically, the present invention relates to a Zygomycetes-der'\\fe6 endoglucanase that has enhanced effects 
for removing fuzz from cellulose-containing fabrics (e.g., an enzyme comprising amino acid sequences of RCE I, RCE 
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II, RCE til, MCE I, MCE II, and PCE I, which attained enhanced effects of removing fuzz from cellulose-containing 
fabrics through the deletion of the cellulose-binding domain, and exhibiting endoglucanase activity, a modified protein 
thereof exhibiting endoglucanase activity, or a homologue of the protein or the modified protein) and a cellulase prep- 
aration comprising such endoglucanase. The present invention also relates to an endoglucanase that was produced 
in a host cell transformed with a gene encoding such an endoglucanases, and further relates to a method for treating 
cellulose-containing fabrics using the endoglucanase, which attained Improved activity through the deletion of the 
cellulose-binding domain, or the cellulase preparation. 
[0011] More specifically, the present invention includes the following. 

(1) A protein that Is a Zygomycetes-6erWe6 endoglucanase lacking the cellulose-binding domain and exhibits 
endoglucanase activity. 

(2) A protein that is a Zygomycetes-derived endoglucanase belonging to family 45 lacldng the cellulose-binding 
domain and exhibits endoglucanase activity. 

(3) The protein according to (1) or (2), wherein the Zygomycetes are microorganisms selected from the group 
consisting of those belonging to Hhlzopus, Mucor, and Phycomyces. 

(4) The protein according to (3), wherein the Zygomycetes are microorganisms belonging to Rhizopus. 

(5) A protein comprising an amino acid sequence as shown in SEQ ID NO: 1 , 3, 5, 7, 9, or 1 1 , wherein the cellulose- 
binding domain has been deleted therefrom, and exhibiting endoglucanase activity, a modified protein thereof 
exhibiting endoglucanase activity, or a homologue of the protein or the modified protein exhibiting endoglucanase 
activity. 

(6) A protein comprising an amino acid sequence as shown In SEQ ID NO: 1 , 3, 5, 7, 9, or 1 1 , wherein the cellulose- 
binding domain has been deleted therefrom, and exhibiting endoglucanase activity. 

(7) A gene encoding the protein, modified protein thereof, or homologue of the protein or the modified protein 
according to any one of (1 ) to (6). 

(8) An expression vector comprising the gene according to (7). 

(9) A host cell transfonned with the expression vector according to (8). 

(10) The host cell according to (9), which Is a filamentous fungus. 

(11 ) The host cell according to (1 0), which Is a microorganism belonging to Humicda. 

(1 2) A method for producing a protein comprising steps of culturing the host cell according to any one of (9) to (11 ) 
and collecting from the host cell obtained by the step of culturing or its culture product the protein, modified protein 
thereof, or homologue of the protein or the modified protein according to any one of (1) to (6). 

(13) A protein produced by the method according to (12). 

(14) A cellulase preparation comprising the protein, modified protein thereof, or homologue of the protein or the 
modified protein according to any one of (1) to (6) and (13). 

(15) A method for treating cellulose-containing fabrics comprising a step of bringing cellulose-containing fabrics 
into contact with the protein, modified protein thereof, or homologue of the protein or the modified protein according 
to any one of (1) to (6) and (13) or the cellulase preparation according to (14). 

(16) A method for reducing the rate at which cellulose-containing fabrics become fuzzy or reducing fuzzing in 
cellulose-containing fabrics comprising a step of bringing cellulose-containing fabrics into contact with the protein, 
modified protein thereof, or homologue of the protein or the modified protein according to any one of (1 ) to (6) and 
(13) or the cellulase preparation according to (14). 

(1 7) A method of weight loss treatment of cellulose-containing fabrics to improve the feel and appearance thereof 
comprising a step of bringing cellulose-containing fabrics into contact with the protein, modified protein thereof, or 
homologue of the protein or the modified protein according to any one of (1) to (6) and (13) or the cellulase prep- 
aration according to (14). 

(18) A method of color clarification of colored cellulose-containing fabrics comprising a step of treating colored 
cellulose-containing fabrics with the protein, modified protein thereof, or homologue of the protein or the modified 
protein according to any one of (1) to (6) and (13) or the cellulase preparation according to (14). 

(19) A method of providing colored cellulose-containing fabrics with partial color change comprising a step of 
treating colored cellulose-containing fabrics with the protein, modified protein thereof, or homologue of the protein 
or the modified protein according to any one of (1) to (6) and (13) or the cellulase preparation according to (14). 

(20) A method for reducing the rate at which cellulose-containing fabrics become stiff or reducing stiffness in cel- 
lulose-containing fabrics comprising a step of treating cellulose-containing fabrics with the protein, modified protein 
thereof, or homologue of the protein or the modified protein according to any one of (1) to (6) and (13) or the 
cellulase preparation according to (14). 

(21) The method according to any one of (15) to (20), wherein fabrics are treated through soaking, washing, or 
rinsing thereof. 

(22) An additive to detergent comprising the protein, modified protein thereof, or homologue of the protein or the 
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modified protein according to any one of (1) to (6) and (13) or the cellulase preparation according to (14) in a non- 
dusting granular fomn or a stabilized liquid fonn. 

(23) A detergent connposition comprising the protein, modified protein thereof, or homologue of the protein or the 
modified protein according to any one of (1) to (6) and (13) or the cellulase preparation according to (14). 
5 (24) A method of deinking waste paper using a deinl<ing aagent wherein the protein, modified protein thereof, or 

homologue of the protein or the modified protein according to any one of (1) to (6) and (13) or the cellulase prep- 
aration according to (14) is used in a step of deinking waste paper. 

(25) A method for improving drainage of paper pulp comprising a step of treating paper pulp with the protein, 
modified protein thereof, or homologue of the protein or the modified protein according to any one of (1) to (6) and 

10 (13) or the cellulase preparation according to (14). 

(26) A method for improving digestibility of animal feeds comprising a step of treating animal feeds with the protein, 
modified protein thereof, or homologue of the protein or the modified protein according to any one of (1) to (6) and 
(13) or the cellulase preparation according to (14). 

IS 1 ■ Endoglucanase lacking cellulose-binding domain 

(001 2] The present Invention relates to a protein that Is a Zygomycefes-de rived endoglucanase lacking the cellulose- 
binding domain and exhibits endoglucanase activity. 

[0013] In this description, "Zypomycefes" refer to microorganisms belonging to Zygomycota, i.e., fungi that generate 

20 zygospores through gametangial copulation upon gametogony. The Zygomycota includes Zygomycetes and Tri- 
chomycetes. The Zygomycetes used in the present invention are not particularly limited. IVIicroorganlsms belonging to 
the Zygomycetes are preferable, those belonging to the Mucomtes are more preferable, those belonging to Rhizopus, 
Mucor, or Phycomyces are still more preferable, and those belonging to the Rhizopus are the most preferable. 
[0014] In this description, "endoglucanase activity" refers to CMCase activity. Further, "CMCase activity" refers to 

25 an activity for hydrolyzing carboxymethylcellulose (CMC, Tokyo Kasei Kogyo, Japan), and one unit is defined as the 
amount of an enzyme which produces reducing sugars corresponding to 1 \irr\o\ of glucose per minute by measuring 
amounts of the reducing sugars released after incubation of a test protein with a CMC solution for a given period of time. 
[0015] Endoglucanase activity can be detemnined by, for example, a procedure as described below. At the outset, 
0.5 ml of a solution containing a test protein Is added to 0.5 ml of 50 mM acette acid-sodium acetate buffer solution 

30 (pH 6.0) with 2% CMC dissolved therein, and the mixture Is subjected to incubation at SO^'C for 30 minutes. Subse- 
quently, the concentration of the reducing sugars produced In the resulting reaction solution is quantified by the3,5-din- 
Itrosallcyllc acid (DNS) method. Specifically, 3.0 ml of a DNS reagent is added to 1 .0 ml of the reaction solution 30 
minutes after the reaction, and the mixture is subjected to incubation in a boiling water bath for 5 minutes. Thereafter, 
the incubation product is diluted with 8.0 ml of distilled water, and the absorbance at 540 nm is measured. A calibration 

35 curve is prepared using a gradually diluted glucose solution, and the amount of the reducing sugars produced in the 
enzyme reaction solution Is determined. Activity is determined using an amount of the enzyme that produces reducing 
sugars con-esponding to 1 jimol glucose per minute as one unit. This DNS reagent can be prepared in accordance 
with publication such as Seibutsu Kagaku Jikkenhou 1 - Kangentou no Teliyouhou (Biochemical Experimentation 1- 
The method for quantifying reducing sugar) (p. 19-20, Sakuzo Fukui, Center for Academic Publications Japan), and 

40 can be prepared in the manner described below. At the outset, 880 ml of 1% 3,5-dinitrosalicylic acid solution and 255 
g of Rochelle salt are added to 300 ml of an aqueous solution of 4.5% sodium hydroxide (solution A). Separately, 10 
g of crystalline phenol is added to 22 ml of an aqueous solution of 10% sodium hydroxide, and water Is further added 
and dissolved in the mixture to bring the amount thereof to 1 00 ml (solution B). Sodium bicarbonate (6.9 g) is added 
to 69 ml of solution B and dissolved therein, solution A is poured therein, and the mixture is stirred and mixed until the 

45 Rochelle salt is thoroughly dissolved. The mixture is allowed to stand for 2 days and then filtered. 

[0016] The term "endoglucanase" used herein refers to an enzyme exhibiting endoglucanase activity, i.e., endo- 
1 ,4-j3-glucanase (EC 3. 2. 1 . 4). This enzyme hydrolyzes the p-1 ,4-glucopyranosyl bond of p-1 ,4-glucan. 
[0017] Endoglucanases are classified into several families based on information such as their amino acid sequences. 
The endoglucanase according to the present invention may belong to any family, and it preferably belongs to family 

50 45. The endoglucanase "belonging to family 45" refers to those types having a consensus sequence, (Ser, Thr, or Ala) 
-Thr-Arg-Tyr-(Trp. Tyr, or Phe)-Asp-Xaa-Xaa-Xaa-Xaa-Xaa-(Cys or Ala), in the catalytic active domain (CAD). Humicola 
insolens-derived EG V (JP Patent Publication (PCX Translation) No. 5-509223), NCE4 (WO 98/03640), and the like 
also belong to family 45. 

[001 8] A protein of endoglucanase belonging to family 45 is comprised a catalytic active domain (CAD), a cellulose- 
55 binding domain (CBD), and a linker domain for linking based on function. The cellulose-binding domain (CBD) is known 
to exist as a domain linking tocellulose as its name suggests, and the conservation of the following consensus sequence 
Is conflmned as a feature of the sequence (Hoffren, A. -M. et al.. Protein Engineering 8: 443-450, 1995). 
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CBD consensus sequence : 

5 1 10 

Xaa Xaa Xaa Xaa Xaa Xaa Gin Cys Gly Gly Xaa Xaa Xaa Xaa 
20 

Gly Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys XaaXaa 
30 

XaaXaaXaaAsnXaaXaaiyrXaaQnC^ Xaa(SEQE)NO: 17) 

[0019] In this sequence, Xaa is independently any amino acid; and Xaas at positions 20, 21 , 22, 23, 24, 30 and 31 
nnay be Independently absent. Other Xaas are always present and are independently any amino acids. Amino acids 
other than Xaa are expressed in three-letter abbreviations. CBD is linked to either the N-terminal side or C temninal 

20 side of CAD through a linker domain. Also reported is Humicola insolens-derived NCE5 (amino acid sequence: SEQ 
ID NO; 38, cDNA sequence: SEQ ID NO. 39) such as a family 45 endoglucanase that does not originally have CBD. 
[0020] Although there is no definite recognition sequence for the linker domain, the sequence is rich in hydrophilic 
amino acid residues such as Ser or Thr, and its length varies depending on types of endoglucanases. 
[0021] Examples of the Zygomycetes-6er\ye6 endoglucanase according to the present invention include enzymes 

2S exhibiting endoglucanase activity derived from Rhizopus, Phycomuyces, or Mucor described in WO 00/24879, I.e.. 
RCE t (SEQ ID NO: 1). RCE II (SEQ ID NO: 3), RCE 111 (SEQ ID NO: 5). MCE I (SEQ ID NO: 7), MCE II (SEQ ID NO: 
9), and PCE I (SEQ ID NO: 11 ). Locations of each of the domains in the amino acid sequence of these enzymes are 
as shown in Table 1 below. 

30 Table 1 



40 





CBD 


A portion in linker domain 


CAD 


SEQ ID NO: 1 


3 to 38 


99 to 108 


109 to 315 


SEQ ID N0:3 


3 to 38 


127 to 136 


137 to 343 




50 to 85 






SEQ ID N0:5 


3 to 40 


122 to 131 


132 to 337 


SEQ ID N0:7 


3 to 40 


104 to 113 


114 to 316 


SEQ ID NO: 9 


3 to 40 


153 to 162 


163 to 365 




52 to 89 






SEQ ID N0:11 


3 to 40 


115 to 124 


125 to 327 



[0022] Amino acid sequences at the N-terminuses of RCE I, MCE I, and PCE I are respectively identified as shown 
in SEQ ID Nos: 14, 15, and 16 (WO 00/24879). 

45 [0023] The protein according to the present invention should not comprise a cellulose-binding domain in the afore- 
mentioned endoglucanase. As long as the protein has endoglucanase activity, there is no particular limitation on the 
structure of other domains. Accordingly, the protein of the present invention may or may not comprise a linker domain. 
The protein may alternatively comprise a portion of a linker domain, and it is preferable if the protein retains a fragment 
of a linker domain comprising about 10 amino acid residues. 

so [0024] The other aspect of the present invention relates to a protein comprising any of the amino acid sequences 
as shown in SEQ ID NO: 1 (RCE I), SEQ ID NO: 3 (RCE II), SEQ ID NO: 5 (RCE III), SEQ ID NO: 7 (MCE 1), SEQ ID 
NO: 9 (MCE 11), or SEQ ID NO: 11 (PCE 1), wherein the cellulose-binding domain has been deleted therefrom, and 
exhibiting endoglucanase activity. The present invention further relates to a modified protein and a homologue of such 
protein exhibiting endoglucanase activity. 

55 [0025] In this description, the term "modified protein" refers to a protein that comprises an amino acid sequence 
having modification such as addition. Insertion, diminution, deletion, or substitution of one or several amino acids in 
the amino acid sequence of RCE I, RCE II, RCE III, MCE I, MCE II. or PCE I, which lacks the cellulose-binding domain. 
The number of the amino acids to be involved with such modification Is not particularly limited as long as the modified 
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protein has endoglucanase activity. The number thereof is preferably 1 to about 50, more preferably 1 to about 30, and 
still more preferably 1 to 9. 

[0026] The term "homologue" used herein refers to a polypeptide having an amino acid sequence coded by a gene 
encoding the amino acid sequences of RCE I, RCE II, RCE III, MCE I, MCE II, and PCE i lacking the cellulose-binding 

5 domain. One example would be a gene (nucleotide sequence) that hybridizes under stringent conditions with DNA 
having any nucleotide sequence as shown In SEQ ID NO: 2 or 13 (RCE I), SEQ ID NO: 4 (RCE II), SEQ ID NO: 6 
(RCE 111), SEQ ID NO: 8 (MCE I), SEQ ID NO: 10 (MCE II), or SEQ ID NO: 12 (PCE I), wherein a portion encoding a 
cellulose-binding domain has been removed and having endoglucanase activity. The term "stringent conditions" refers 
to conditions under which, while a probe that comprises the nucleotide sequence encoding a part or all of the amino 

10 acid sequences of RCE I, RCE II, RCE III, MCE I, MCE 11, and PCE I lacking the cellulose-binding domain or an amino 
acid sequence of its modified protein hybridizes with a gene encoding a homologue, this probe is controlled to such 
an extent that it does not hybhdize with the endoglucanase NCE4 gene (SEQ ID NO: 18) according to WO 98/03640 
or the eridoglucanase SCE3 gene (SEQ ID NO: 19) according to WO 98/54332 (wherein the amount of DNA used Is 
equal to that of the gene encoding the NCE4 gene, SCE3 gene, or a homologue). A specific example of "stringent 

15 conditions" Is as follows. A labeled probe having a full length DNA sequence encoding amino acid sequences such as 
RCE 1, which lacks the cellulose-binding domain, is used. In accordance with the method of the ECL Direct DN/V/RNA 
Labeling Detection System (Amersham), prehybridization is carried out at 42'C for 1 hour, the probe Is added, and 
hybridization is then carried out at 42*'C for 15 hours. Thereafter, 0.5x SSC (1x SSC; 15 mM trisodlum citrate, 150 mM 
sodium chloride) comprising 0.4% SDS and 6M urea is used to perform washing twice at 42°C for 20 minutes. Sub- 

20 sequently, 5x SSC is used to perfonn washing twice at room temperature (about 25''C) for 1 0 minutes. 

[0027] Examples of such modified proteins or homologue include a protein having an amino acid sequence that is 
preferably at least 70%. more preferably at least 80%, still more preferably at least 90%, even more preferably at least 
95%, and most preferably at least 98% homologous to the amino acid sequence of RCE I, RCE II, RCE 111, MCE I, 
MCE It, or PCE 1, which lacks the cellulose-binding domain. The numerical values indicating homology may be deter- 

25 mined using a program for searching for homology that Is known to a person skilled in the art. Preferably, these numerical 
values are detemilned using default (Initial setting) parameters at FASTA3 (Science, 227 , 1435-1441 (1985); Proc. 
Natl. Acad. Scl. USA, 85, 2444-2448 (1988); http://www.ddbj.nig.ac.jp/E-mail/homology-j.html ). 
[0028] The protein of the present invention does not comprise a cellulose-binding domain. Accordingly, the modified 
protein and the homologue should not comprise a cellulose-binding domain. This can be confimned by Investigating 

30 the amino acid sequence of the object protein and whether or not the consensus sequence (Hoffren, A. -M. et al., 
Protein Engineering 8: 443-450, 1995; SEQ ID NO: 17) is present therein. 

[0029] Also, the protein of the present Invention has endoglucanase activity. Accordingly, the modified protein and 
the homologue should also have endoglucanase activity. This can be confirmed by investigating endoglucanase activity 
of the object protein by the aforementioned method. 

35 [0030] The protein of the present invention can be produced as a protein comprising an amino acid sequence that 
does not comprise a cellulose-binding domain by a method known to a person skilled in the art based on the amino 
acid sequence of a known endoglucanase derived from Zygomycetes, Examples of such a method include a method 
that is canied out by decomposing In a linker domain using protease during the culture of Zygomycetes producing 
endoglucanase to cause deletion of the cellulose-binding domain and a method that is carried out by artificially ex- 

40 pressing endoglucanase that does not have a cellulose-binding domain due to genetic engineering techniques. Among 
the proteins of the present Invention, in particular, the modified protein and the homologue can be prepared using DNA 
encoding their amino acid sequences by a genetic engineering technique that is known to a person skilled in the art. 
[0031 ] The protein of the present invention can yield a higher effect than the original endoglucanase having a cellu- 
lose-binding domain in fabric treatment or other applications to detergents or paper pulp. Particularly, a much higher 

45 effect can be attained in the activity of removing fuzz from reproduced cellulose fabric such as lyocell (per protein 
weight) and in the activity of removing fuzz from cotton fabric such as knitted cotton (per protein weight). The protein 
of the present invention is preferably twice or higher, more preferably 2.5 times or higher, and most preferably 3 times 
or higher in the activity of removing fuzz from reproduced cellulose fabric (such as tyocell) (per protein weight) as a 
purified endoglucanase having the cellulose-binding domain. Or, the protein Is preferably 5 times or higher, more pref- 

50 erably 15 times or higher, and most preferably 20 times or higher in the activity of removing fuzz from cotton fabric 
such as knitted cotton (per protein weight) as a purified endoglucanase having the cellulose-binding domain. 

2. Gene, expression vector, host cell transformed with the expression vector, and production of endoglucanase lacking 
the cellulose-binding domain using the host cell 

55 

[0032] The present invention relates to a gene encoding the protein of the present invention, a modified protein 
thereof, or a homologue of the protein or the modified protein and an expression vector comprising the gene. 
[0033] The gene of the present invention may be any gene as long as it encodes the protein of the present invention , 
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a modified protein thereof, or a homologue of the protein or the modified protein. Specific details of their nucleotide 
sequences are not particularly limited. Examples of usable genes in order to express RCE I, RCE II, RCE HI, MCE I, 
MCE II, or PCE I, which lacks the cellulose-binding domain, as the protein of the present invention include those 
comprising nucleotide sequences as shown in SEQ ID NO: 2 (RCE I), SEQ ID NO: 4 (RCE II, SEQ ID NO: 6 (RCE III), 

5 SEQ ID NO: 8 (MCE I). SEQ ID NO: 10 (MCE II), or SEQ ID NO: 12 (PCE I). 

[0034] Specifically, the gene of the present Invention comprises an amino acid sequence as shown In SEQ ID NO: 
1 , 3, 5, 7, 9, or 11 , wherein the cellulose-binding domain has been deleted therefrom, and encodes a protein exhibiting 
endoglucanase activity, a modified protein thereof exhibiting endoglucanase activity, or a homologue of the protein or 
the modified protein exhibiting endoglucanase activity. 

10 [0035] The gene of the present invention comprises DNA as described in the following (a) or (b). 

(a) DNA that has a nucleotide sequence as shown In SEQ ID NO: 2 or 13, 4, 6, B, 10, or 12, wherein a portion 
encoding a cellulose-binding domain has been deleted therefrom. 

(b) DNA that hybridizes under stringent conditions with DNA that has a nucleotide sequence as shown In SEQ ID 
IS NO: 2 or 1 3, 4, 6, 8, 1 0, or 1 2, wherein a portion encoding a cellulose-binding domain has been deleted therefrom, 

[0036] The temi "stringent conditions" means the conditions described in the section 1 . above. 

[0037] The nucleotide sequences of the aforementioned genes can be optimized depending on the type of host cell 

used in the later transformation. These nucleotide sequences can be optimized with respect to, for example, the codon 

20 usage in a host cell or the Intron recognition sequence in a host cell. The codon usage can be optimized by, for example, 
modifying nucleotide sequences so as to comprise as many codons used in a host cell at high frequency as possible 
without changing the amino acid sequence to be coded. This can improve the efficiency of translation from genes into 
proteins. The intron recognition sequence can be optimized by, for example, modifying nucleotide sequences so as to 
have no DNA sequence, which could be recognized as an intron in a host cell, or to have as few sequences as possible 

25 without changing the amino acid sequence to be coded. This can improve the stability of mRNA that is a transcript of 
an object gene. The Intron recognition sequences vary depending on types of host cells. Examples of intron recognition 
sequences of filamentous fungi belonging to the Fungi imperfecti Include DNA sequences such as GTAGN, GTATN, 
GTAAN, GTACGN, GTGTN, GCACGN, and GTTCGN ("N" stands for A, T, C, or G in each sequence). 
[0038] In this description, the tenm "codon optimized gene" refers to a gene that is obtained by optimizing the codon 

30 usage and/or the intron recognition sequence. Preferably, the codon optimized gene is a gene obtained by optimizing 
the codon usage, and more preferably a gene obtained by optimizing the codon usage and the intron recognition 
sequence. This codon optimized gene is preferably optimized for the expression in filamentous fungi belonging to the 
Fungi imperfecti. Examples of such codon optimized genes include a gene which lacks a portion encoding a cellulose- 
binding domain in the codon optimized endoglucanase RCE I gene (SEQ ID NO; 13) as disclosed in WO 00/24879. 

35 [0039] The expression vector of the present invention comprises an object gene (a gene encoding the protein of the 
present Invention, a modified protein thereof, or a homologue of the protein or the modified protein) In a state that is 
replicable in a host cell and expressible of the protein of the present invention, a modified protein thereof, or a homologue 
of the protein or the modified protein. This expression vector can be constructed based on a self-replicating vector, I. 
e., a vector that exists as an extrachromosomal entity and replicates independently of the chromosome, for example, 

40 a plasmld. Altematively, the expression vector may be a vector that is integrated into the genome of the host cell upon 
introduction thereinto and replicated together with the chromosome into which It has been incorporated. For the con- 
struction of the vector of the present invention, conventional procedures and methods used In the field of genetic 
engineering can be used. 

[0040] For the expression of the protein of the present Invention, a modified protein thereof, or a homologue of the 
45 protein or the modified protein upon introduction into the host cell, it is desirable that the expression vector of the 
invention contains DNA sequences to regulate the expression and gene markers, etc. to select transformants, in ad- 
dition to the gene encoding the protein of the present invention, a modified protein thereof, or a homologue of the 
protein or the modified protein. Examples of expression regulatory DNA sequences include promoters, terminators, 
and DNA sequences encoding signal peptides. The promoters and the tenninators are not particularly limited as long 
so as they show transcription activity in the host cell. They may be obtained as DNA sequences, which control the ex- 
pression of a gene encoding a protein homogeneous or heterogeneous to the host cell. The signal peptides are not 
particularly limited as long as they contribute to the secretion of protein in the host cell. They may be obtained from 
DNA sequences derived from a gene encoding a protein homogeneous or heterogeneous to the host cell. The gene 
mari<ers of the Invention can be appropriately selected depending on the method for selecting transformants. For 
55 example, genes encoding drug resistance or genes complementing auxotrophy may be used. Each of these DNA 
sequences and gene markers is operably linked to the expression vector of the present invention. 
[0041] Further, the present invention relates to a host cell that Is transfonued with the expression vector. The ex- 
pression vector Introduced into a host cell should be replicable therein. Thus, the host cell that is used herein varies 
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depending on the type of vector used in the production of the expression vector. Alternatively, in accordance with the 
type of a host cell to be used, an expression vector can be produced so as to be replicable therein. Specifically, in 
order to obtain a transformant that expresses the protein of the present invention, a modified protein thereof, or a 
homologue of the protein or the modified protein, a host cell and an expression vector should be adequately combined. 
5 Such a combination is referred to as a host-vector system. The host-vector system that is used in the present invention 
is not particularly limited. Examples thereof Include systems using microorganisms such as Escherichia coil, Actino- 
mycetes, yeast, and filamentous fungi as host celts, and a system using filamentous fungi is preferred. An expression 
system for a fusion protein with other protein can be also used. 

[0042] When filamentous fungi are used as host cells, any type of filamentous fungi can be used, and prefen-ed 
10 examples thereof Include those belonging to Humlcola, Asperflllus, or Trichoderma. Particularly preferred examples of 
these filamentous fungi include Humicola irisolens, Aspergillus riiger or Aspergillus oryzae, and Trichoderma viride. 
[0043] A host cell can be transformed with the expression vector of the present invention in accordance with con- 
ventional methods used in the field of genetic engineering. 

[0044] The thus obtained transfomnant (transfomried host cell) is cultured In a suitable medium, and the protein of 
IS the present Invention, a modified protein thereof, or a homologue of the protein or the modified protein can be Isolated 
and obtained from the culture product. Accordingly, another aspect of the present invention relates to a method for 
producing a protein comprising steps of culturing the host cell of the present Invention and collecting the protein of the 
present invention, a modified protein thereof, or a homologue of the protein or the modified protein from the host cell 
obtained by the step of culture or a culture product thereof. Culture methods and other conditions for transfonnants 
20 may be substantially the same as those for microorganisms to be used. The transfonnants can be cultured and the 
object protein can be then collected by a conventional method of this technical field. 

3. Cellulase preparation 

25 [0045] The present invention relates to a cellulase preparation that comprises the protein of the present invention, 
a modified protein thereof, a homologue of the protein or the modified protein, or a protein produced by the method 
for producing a protein according to the present invention. 

[0046] In general, a cellulase preparation is powder, liquid, or the like that comprises, for example, an excipient (e. 
g., lactose, sodium chloride, or sorbitol), a preservative, or a nonionic surfactant, In addition to a cellulase enzyme. For 

30 example, it Is fonnulated as a powdery, particulate, granular, non-dusting granular, or liquid preparation. The cellulase 
preparation of the present invention comprises, as the cellulase enzyme, the protein of the present invention, a modified 
protein thereof, a homologue of the protein or the modified protein, or a protein produced by the method for producing 
a protein of the present invention (hereinafter referred to as 'Ihe proteins of the present invention"). Further, the cellulase 
preparation of the present Invention may comprise, in addition to the proteins of the present invention, other cellulase 

35 enzymes, for example, cellobiohydrolase, p-glucosidase, or endoglucanase which are not involved in the present in- 
vention. 

[0047] One type of cellulase preparation, a non-dusting granular preparation, may be prepared by a conventional 
dry granulating method. Specifically, the powdery proteins of the present invention are mixed with one or several of: 
neutral inorganic salts that do not affect endoglucanase activity represented by sodium sulfate, sodium chloride, or the 

40 like; minerals that do not affect endoglucanase activity represented by bentonite, montmorillonite, or the like; or neutral 
organic substances represented by starch, particulate cellulose, or the like. A powder or fine suspension of one or 
several nonionic surfactant is then added thereto, followed by thorough mixing or kneading. Depending on the situation, 
a synthetic polymer represented by polyethylene glycol for binding solid matter or a natural polymer such as starch is 
suitably added and further kneaded. Thereafter, extrusion granulation is carried out using, for example, Disc Pelleter, 

45 and the extruded granules are then shaped into spherical form using the Mammerizer, followed by drying. Thus, non- 
dusting granules can be produced. The amount of one or several nonionic surfactants to be added is not particularly 
limited. The amount Is preferably 0.1% to 60% by weight, more preferably 0.1% to 30% by weight, and still more 
preferably 1% to 20% by weight, based on the entire cellulase preparation according to the present invention. Oxygen 
permeation or water pemieation can be regulated by coating the surfaces of granules with a polymer, etc. 

50 [0048] In contrast, a liquid formulation can be prepared by incorporating a stabilizer for the endoglucanase enzyme 
such as a synthetic polymer, a natural polymer, orthe like into a solution comprising the proteins of the present invention 
and adding inorganic salts or synthetic preservatives according to need. In this case, oneorseveral nonionic surfactants 
can be also incorporated. The amount of one or several nonionic surfactants to be added is not particularly limited. 
The amount is preferably 0.1% to 50% by weight, more preferably 0.1% to 30% by weight, and stilt more preferably 

55 1% to 20% by weight, based on the entire cellulase preparation according to the present invention. 
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4. Application of the proteins of the present invention and the cellulase preparation of the present invention 

[0049] The present invention relates to a method for treating cellulose-containing fabrics. This method comprises a 
step of bringing cellulose-containing fabrics into contact with the proteins of the present invention or the cellulase 
5 preparation of the present invention. Conditions such as contact temperature or amounts of the proteins or the cellulase 
preparation can be suitably determined with respect to various other conditions. 

[0050] The aforementioned method can be used to reduce the rate at which cellulose-containing fabrics become 
fuzzy or to reduce fuzzing In cellulose-containing fabrics. In this application, the proteins or the cellulase preparation 
at the protein concentration of 0.001 to 1 mg/l is preferably used at about 30*0 to 60**C. 
10 [0051 ] The aforementioned method can be used in weight loss treatment of cellulose-containing fabrics to improve 
the feel and appearance thereof. In this application, to improve the feel means to reduce the rate at which the feel is 
spoiled. In this application, the proteins or the cellulase preparation at the protein concentration of 0.001 to 100 mg/l 
is preferably used at about 30° C to BCC. 

[0052] The aforementioned method can be used for color clarification of colored cellulose-containing fabrics. 
IS [0053] The aforementioned method can be used for providing colored cellulose-containing fabrics with partial color 
change. In this application, for example, colored cellulose-containing fabrics (e.g., denim) can be provided with an 
appearance of stone-washed material. In this application, the proteins or the cellulase preparation at the protein con- 
centration of 0.01 to 100 mg/l is preferably used at about 40*C to 60'C. 

[0054] The protein concentrations of various endoglucanases were calculated from the peak area at UV 280 nm of 

20 respective endoglucanase eluted with a linear gradient from 0% to 80% of acetonitrlle concentration in 0.05% TFA 
(trifluoroacetic acid) at a flow rate of 1 .0 mi/min in HPLC analysis using TSK gel TMS-250 column (4.6 mm I.D. x 7.5 
cm, TOSOH Japan). The standard used was the purified NCE4, which was analyzed in HPLC under the same condi- 
tions, the protein concentration of which had been preliminarily measured by a protein assay kit (BioRad Laboratories). 
The purified NCE4 is purified from a culture product of Humicota insolens MN 200-1, which was deposited at the 

25 International Patent Organism Depositary of the National Institute of Advanced Industrial Science and Technology 
(Tsukuba Central 6, 1-1-1 Higashi, Tsukuba, ibaraki, Japan) under the accession number of PERM BP-5977 (initial 
deposition: PERM P-15736, date of initial deposition: July 15, 1996), in accordance with the method as described In 
WO 98/03640. The standand used to measure the protein concentration using the protein assay kit is the Albumin 
Standard (Bovine serum albumin, fraction V, PIERCE). 

30 [0055] The aforementioned method can be used to reduce the rate at which cellulose-containing fabrics become stiff 
or to reduce stiffness in cellulose-containing fabrics. In this application, cellulose-containing fabrics can be softened. 
[0056] In the aforementioned applications, cellulose-containing fabrics can be treated through soaking, washing, or 
rinsing of the fabrics. Specifically, the aforementioned method of the present invention can be carried out by treating 
cellulose-containing fabrics during washing. On some occasions, however, the treatment of fabrics may be carried out 

35 during soaking or rinsing by adding the proteins or the cellulase preparation of the present invention into water where 
the fabrics are soaked or to be soaked. 

[0057] The present invention relates to an additive to detergent comprising the proteins or the cellulase preparation 
of the present invention in a non-dusting granular form or a stabilized liquid forni. The present Invention further relates 
to a detergent composition comprising the proteins or the cellulase preparation of the present Invention. 

40 [0058] This detergent composition may also contain a surfactant (which may be anionic, nonionlc, cationic, ampho- 
teric, or zwltlerlonic surfactant, or a mixture thereof). Further, this detergent composition may contain other detergent 
components known in the art, such as builders, bleaching agents, bleaching activators, corrosion inhibitors, seques- 
tering agents, soil-dissociating polymers, aromatics, other enzymes (e.g.. protease, lipase, or amylase), enzyme sta- 
bilizers, formulation assistants, fluorescent brightening agents, foaming promoters, etc. Examples of representative 

45 anionic surfactants include linear alkyl benzene sulfonate (LAS), alkylsulfate (AS), a-olefln sulfonate (AOS), polyox- 
yethylene alkyl ether sulfate (AES), a-sulfonato fatly acid methyl ester (a-SFMe), and alkali metal salts of natural fatty 
adds. Examples of nonionlc surfactants Include polyoxyethylene alkyl ether (AE), alkyl polyethylene glycol ether, no- 
nytphenol polyethylene glycol ether, fatty acid methyl ester ethoxylate, fatty acid esters of sucrose or glucose, and 
esters of alkyl glucoside and polyethoxylated alkylglucoside. 

so [0059] The use of the proteins or the cellulase preparation of the present invention in a detergent composition can 
improve perfomiances regarding particulate soil removal, color clarification, fuzz prevention, depillrng, and reduction 
of stiffness. 

[0060] The present invention relates to a method of delnking waste paper using a deinking agent wherein the proteins 
or the cellulase preparation of the present invention is used in a step of deinking waste paper with a deinking agent. 
55 [0061] When the protein or the cellulase preparation of the present invention Is acted on waste paper, the efficiency 
of deinking is enhanced, and thus, the protein or the cellulase preparation of the present invention Is useful In the 
process of manufacturing recycled paper from waste paper This deinking method can significantly decrease Ink-re- 
maining fibers. Therefore, the whiteness of the waste paper can be enhanced. 
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[0062] The aforementioned "deinking agent" is not particularly limited as long as it Is a commonly used agent when 
deinking waste paper. Examples thereof include alkali such as NaOH or NaaCOg, sodium silicate, hydrogen peroxide, 
phosphates, anionic surfactants, nonionic surfactants, capturing agents such as oleic acid, and examples of aids in- 
clude pH stabilizers, chelating agents, and dispersants. 

5 [0063] Waste paper, which can be treated by the above deinking method, is not particularly limited as long as it can 
be generally referred to as waste paper. Examples of waste paper Include: used printed paper containing mechanical 
pulp and chemical pulp such as used newspaper, used magazine paper and low-grade or middle-grade used printed 
paper, used wood-free paper composed of chemical pulp; and coated paper thereof. Further, the above deinking method 
can be applied to any paper on which ink has been deposited if the paper Is not generally referred to as waste paper. 

10 [0064] Further, the present Invention relates to a method for improving drainage of paper pulp. This method comprises 
a step of treating the paper pulp with the proteins or the cellulase preparation of the present invention. 
[0085] According to this method, drainage of paper pulp can be remarkably Improved without significant deterioration 
in paper strength. Pulp, which can be treated by this method. Is not particularly limited, and examples thereof Include 
waste paper pulp, recycled board pulp, kraft pulp, sulfite pulp, processed or thermo-mechanical pulp, and other high- 

is yield pulp. 

[0066] The present invention further relates to a method for improving the digestibility of animal feeds. This method 
comprises a step of treating animal feeds with the proteins or the cellulase preparation of the present invention. 
[0067] According to this method, molecular weights of glucans in animal feeds are suitably lowered. Thus, the di- 
gestibility of animal feeds can be improved. 
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5. Deposition of microorganisms 



[0068] The Rhizopus oryzae CP96001 strain was deposited at the International Patent Organism Depositary of the 
National Institute of Advanced Industrial Science and Technology (Tsukuba Central 6.1-1-1 Higashi, Tsukuba. Ibaraki 
25 Japan) under the accession number of PERM BP-6889 on April 21, 1997. 

[0069] The Mucor circinelbides CP99001 strain was deposited at the International Patent Organism Depositary of 
the National Institute of Advanced Industrial Science and Technology (Tsukuba Central 6. 1-M Higashl. Tsukuba, 
Ibaraki, Japan) under the accession number of PERM BP-6890 on July 2, 1 999. 

[0070] The Phycomyces nitens CP99002 strain was deposited at the International Patent Organism Depositary of 
the National Institute of Advanced Industrial Science and Technology (Tsukuba Central 6, 1-1-1 Higashi. Tsukuba. 
Ibaraki, Japan) under the accession number of PERM BP-6B91 on July 2. 1 999. 

[0071] The Escherk:hia coliM 109 strain that was transfomied with the expression vector pMKDOl used in the 
present invention was deposited at the International Patent Organism Depositary of the National Institute of Advanced 
Industnal Science and Technology (Tsukuba Central 6, 1-1-1 Higashi. Tsukuba. ibaraki. Japan) under the accession 
number of PERM BP-5974 (initial deposition: PERM P-15730. date of initial deposition: July 12. 1996). 
[0072] The Humicola insolens MN 200-1 strain that can be a host for the expression vector of'the present Invention 
was deposited at the International Patent Organism Depositary of the National Institute of Advanced Industrial Science 
and Technology (Tsukuba Central 6, 1-1-1 Higashi, Tsukuba, Ibaraki, Japan) under the accession number of PERM 
BP-5977 (initial deposition: PERM P-15736, date of initial deposition: July 15, 1996). 

[0073] This description Includes part or all of the contents as disclosed In the description of Japanese Patent Appli- 
cation No. 2000-354296, which Is a priority document of the present application. 

Best Modes for Canrying out the Present Invention 

[0074] The present invention Is described In more detail with reference to the following examples and reference 
examples but is not limited thereto. 

[0075] In the following description, "endoglucanase activity" refers to CMCase activity. Further. "CMCase activity" 
measured by the amount of the reducing sugars released after the incubation of a cellulase enzyme and a carboxymeth- 
ylcellulose (CMC, Tokyo Kasei Kogyo. Japan) solution for a given period of time and one unit Is defined as the amount 
of an enzyme that produces reducing sugars con-esponding to 1 nmol of glucose per minute. 
[0076] In implementing the following tests, examples disclosed in WO 98/03667 and WO 00/24879 were referred to 
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[Reference Examples] 

Reference Example 1: cDNA cloning of cellulase NCE5 
(1) Isolation of cDNA and preparation of library 



(i) Preparation of total RNA 



me] Humicola/nsotensMN 200-1 (PERM BP.5977) was cultured in medium (N) (5.0% Avicel 2 0% yeast extract 
0.1 % polypeptone, 0.03% calcium chloride, 0.03% magnesium chloride dH S m for !> ri^S aZ' ,. T ' 

by centmuga^n (3,500 rpm, 1 0 minutes). Among the colilt s Tr^^ l JZ:^T:ZS:^^^ 
rozen .n Nu.d nitrogen, and then ground in liquid nitrogen using a mortJand a pe^^.^a, rna Lti 11 

conZed b^tr::;^'^ J^'"* ^^'^^^ ^"^^ ^"^^^ a«ached l eTand o^rRS^ 

confimied by fomialdehyde agarose gel electrophoresis as a chromatic figure. 

(li) Preparation of poly(A)tail+RNA (= mRNA) 

(iii) Synthesis of cDNA 

(iv) Preparation of cDNA library 

(2) Amplification and analysis of DNA by PGR 

[0083] The following synthetic oligonucleotides were prepared as primers 
N-tem,inal: 5'-TAY TGG GAY TGY TGY AAR CC-3" (20mer) (SEQ ID NO- 36) 
T-43.0:5'-TCI GCR TTI ARR AAC CAR TC-3' (20mer) (SEQ ID NO: 37) ' 

(In these nucleotide sequences. R indicates G or A; Y Indicates T or C; and I Indicates inosine.) 
sequence.wh.hwasdeducedfromthedetermlnednucteotidesequence,contaLSU^^^^^ 
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es of cellulase NCE5. Thus, the deduced amino acid sequence was used as a probe in the following screening process. 

(3) Cloning of the gene that encodes NCE5, the cellulase component 
(i) Screening by plaque hybridization 

[0086] The 500 bp DNA fragment (1 00 ng) amplified by PGR was previously labeled with the ECL Direct DNA/RNA 
Labeling Detection System (Amersham Phamiacia Biotech). 

[0087] The phage plaque prepared in (1 )-(iv) was transferred to the Hybond-N-f. nylon transfer membrane (Amersham 
Pharmacia Biotech) and subjected to all<ali treatment with 0.4N sodium hydroxide. Recombinant phage DNA on the 
membrane was denatured into a single strand. Thereafter, the nylon membrane was washed with 5x SSC {1x SSC: 
15 mM trisodium citrate, 150 mM sodium chloride) and then air dried to immobilize DNA thereon. Thereafter, the nylon 
membrane was hybridized in accordance with the manual of the kit, detected, and exposed to Fuji Medical'x-ray film 
(Fuji Photo Film) to yield 6 positive clones. 

(11) Preparation of phage DNA 

[0088] DNA was prepared from the positive clones as plasmid DNA in accordance with the manual attached to the kit. 
[0089] A plasmid in which a DNA fragment was cloned into pBluescript SK(-) was prepared from amplcillin-resistant 
Escherichia coll SOLRtm. Using this plasmid as a template, PGR was carried out using the N-terminal and T-43.0 
primers used in (2) under the same conditions as described above. As a result, a 500 bp amplification product was 
obtained from one plasmid. Thus, it Is deduced that the object DNA was cloned into this plasmid. This plasmid was 
digested with EcoRI and then subjected to agarose gel electrophoresis. 
[0090] As a result, the plasmid was found to contain about 1 kbp EcoRI fragment. 

(4) Detennination of cDNA nucleotide sequence 

[0091] A nucleotide sequence of the about 1 kbp EcoRI fragment that was inserted Into a positive recombinant 
pBluescript SK(-) plasmid obtained in (3)-(ll) was determined in the same manner as described above using primers 
for sequencing T3 and T7. As a result, this nucleotide sequence was found to contain a 672 bp open reading frame 
(ORF). Amino acid sequences deduced from the nucleotide sequence and the ORF were shown in SEQ ID NO- 39 
and SEQ ID NO: 38 in the Sequence Listings. 

[0092] Further, 1 to 1 8 amino acid sequences in this ORF were considered to be signal sequences for secreting the 
protein extracellularly. 

Reference Example 2 : Expression of NCE5 gene In Humicota Insolens 

[0093] Plasmid pJDOl (see Example D1 (2) (b) In WO 00/24879) was used as an expression vector in Humicola 
insolens MN 200-1 (PERM BP-5977) and constructed In the following manner. 

(1) Construction of NCE5 expressing plasmid pJND-c5 

(I) Site-directed mutagenesis into NCE5 gene 

[0094] In order to ligate the NCE5 gene to the SamHI site of plasmid pJD01 , a primer was constructed so as to 
previously comprise BamHl site in a sequence Immediately upstream of the initiation codon and that immediately 
downstream of the termination codon, and the NCE5 gene was amplified by PGR. Primers for mutagenesis were 
designed as shown below. 
NCE5-N-BamHI: 



5'-GGGGATCCrcCK}ACAAGATGCACKnXX(XCTGACCAC^^ OSmw) (SEQ ID NO: 40); 
NCES-C-BamHI: 

5'<5GGGATCX7rGCAmAACGCGAGCAGCCGCIXn^ (38iner) (SEQ ID NO: 41). 
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[0095] PGR was carried out under the same conditions as described above using the positive recombinant pBlue- 
script SK(-) plasmid obtained in Reference Example 1 as a template. As a result, an about 670 bp amplification product 
of the DNA fragment was confimied by 1.0% agarose gel electrophoresis. Unreacted matter was removed by the 
MicroSpin S-400 HR Columns (Amersham Phamiacia Biotech), precipitated with ethanol, and then digested with Bam- 
HI Subsequently, a total amount was subjected to 1 .0% agarose gel electrophoresis, a 670 bp DNA fragment was 
collected using the Sephaglas BandPrep Kit (Amersham Phamnacia Biotech) In accordance with the attached manual, 
and the BamHI fragment was subcloned into the fiamHI site of plasmid pUC11 8 to yield plasmid pNCESBam. Further, 
the nucleotide sequence of this inserted fragment was detenmrned and confimied by the aforementioned method. 

(ii) Preparation of plasmid pJND-c5 

[0096] The above plasmid pJD01 was digested with BamHI and separated by 0.8% agarose gel electrophoresis. An 
about 8.0 kbp DNA fragment was collected by the Sephaglas BandPrep Kit, and the collected DNA fragment was 
dephosphorylated using Escherichia coii-derlved alkaline phosphatase (Takara Shuzo Co., Ltd.) in accordance with 
the attached manual. Similarly, the plasmid pNCESBam obtained in (i) was digested with BamHI, and 670 bp DNA 
fragments were collected and iigated to the above DNA fragment using the DNA Ligation Kit Ver. 2 to obtain expression 
plasmid pJND-c5. 

(2) Transfomiation of Humicola insolens with plasmid pJND-c5 

[0097] Humicola insolens MN 200-1 (FERM BP-5977) was cultured in medium (S) at 37''C for 24 hours, and then 
the cells were collected by centrifugatlon at 3,000 rpm for 1 0 minutes. Medium (S) is composed of medium (N) described 
in Reference Example 1 having glucose (3.0%) added thereto and Avicei removed therefrom. The collected cells were 
washed with 0.5M sucrose and suspended in 10 ml of enzyme solution for preparing protoplast (3 mg/ml p-glucuroni- 
dase, 1 mg^mi Chitinase. 1 mg/ml Zymoiyase. and 0.5M sucrose) filtered through a 0.45 fim filter. The suspension was 
shaken at 30*C for 60 to 90 minutes to render fungal threads to be protoplasted. This suspension was filtered and then 
centrif uged at 2,500 rpm for 1 0 minutes, and the protoplast was collected and then washed with a SUTC buffer (0.5M 
sucrose, 10 mM calcium chloride, and 10 mM Tris hydrochloride (pH 7.5)). 

[0098] The thus prepared protoplast was suspended in 1 mL of SUTC buffer, and 10 jil of DNA (TE) solution was 
added to the suspension in amounts of 1 0 jig per each 1 00 \l\ of the suspension. The mixture was allowed to stand in 
ice for 5 minutes. Subsequently, 400 jil of PEG solution (60% PEG 4000, 10 mM calcium chloride. 10 mM Tris hydro- 
chloride (pH 7.5)) was added and the mixture was allowed to stand in ice for 20 minutes. Thereafter. 10 ml of SUTC 
buffer was added, and centrif ugation was carried out at 2,500 rpm for 1 0 minutes. The collected protoplast was sus- 
pended in 1 ml of SUTC buffer, centrifuged at 4,000 rpm for 5 minutes, and finally suspended in 1 00 ^1 of SUTC buffer. 
[0099] The protoplast treated as above was superposed on the hygromycln-added (200 jig(ml) YMG medium (1% 
glucose, 0.4% yeast extract, 0.2% malt extract, 1% agar (pH 6.8)) together with YMG soft agar. Culture was incubated 
at 37°C for 5 days. Thereafter, the generated colony was detemiined to be a transformant. 

[Example 1] Construction of gene expressing RCE I variant H43. which lacks the cellulose-binding domain 

[0100] Plasmid p18-1 containing the codon optimized endoglucanase gene RCE I (see Example 03 (1) g) in WO 
00/24879) was digested with the restriction enzyme BamHI to prepare plasmid pRI H4 wherein a fragment containing 
the codon optimized endoglucanase gene has been cloned into the BamHI site of plasmid pUCII 8. This plasmid p R1 H4 
was used as a template to perform first-phase PCR using two synthetic DNAs. i.e., RC-43F and RC-43R, as primers 
and the TaKaRa LA PCR in vitro Mutagenesis Kit (Takara Shuzo Co.. Ltd.). Reaction conditions were in accordance 
with the manual attached to the kit, A sample was separated after the reaction by agarose gel electrophoresis to obtain 
an about 650 bp gene fragment 43-X2. 

[0101] Using plasmid pJND-c5 as a template, which comprises the NCE5 gene originally having no cellulose-binding 
domain and being a family 46 endoglucanase as described in Reference Example 2, first-phase PCR was similarly 
carried out using two synthetic DNAs. i.e., NX-43F and NX-43R, as primers and the TaKaRa LA PCR in vitro Muta- 
genesis Kit (Takara Shuzo Co., Ltd.). Reaction conditions were in accordance with the manual attached to the kit. A 
sample was separated after the reaction by agarose gel electrophoresis to obtain an about 1 20 bp gene fragment 43-X1 . 



RC-43F: CACXACCKZGCTACrGGGACT (SEQ ID NO: 20); 
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RC-43R: GGArcCIXK:cnTTAClTGC (SEQ ID NO: 21); 



NX-43F: GGATCCTGGGACAAGATG (SEQ ID NO: 22); 



NX-43R: GCACGACXXSCTTGCAGC (SEQ ID NO: 23) 

[0102] Annealing and second-phase PGR were carried out using PGR fragments 43-X1 and 43-X2 and the TaKaRa 
LA PGR in vitro Mutagenesis Kit. Two synthetic DMAs, i.e., NX-43F and RC-43R, were used as primers and reaction 
conditions were in accordance with the manual attached to the l^it. A sample was separated after the reaction by 
agarose gel electrophoresis to obtain an about 700 bp gene fragment H43. This fragment was digested with the re- 
striction enzyme BamH\ to prepare plasmid pR1H43 ligated to the BarriHl site of plasmid pUG118 using the TaKaRa 
DNA Ligation Kit Ver. 1 . Reaction conditions, such as those regarding enzymes, were in accordance with the manual 
attached to the kit The obtained plasmid pR1 H43 was subjected to sequencing using the Cy5-Auto Read Sequencing 
Kit (Amersham Phannacia), and the sequence was analyzed using the DNA Sequencer ALFred (Amersham Pharma- 
cia) . A primer used in the reaction was the M 1 3 primer, which was attached to the kit. As a result, It was confirmed that 
the sequence was In the expected form. i.e.. a secretion signal on the N-terminal side was derived from NCE5 and the 
remaining catalytic active domain (GAD) was a sequence derived from ROE I. The amino acid sequence of the RCE 
I variant H43 deduced from the nucleotide sequence is shown below. 



H43: MQlJPLrriXTm>ALAAAQSGSGRrmYWDCCKPSC^ 

VTAI^DSNAQSGCNGGNSYMCNDNQPWAVNDNIJVY 

reTSVAGKKMWQVTWGGDIXr^^ 

YGGISSASDCSS1J>SAU3AGCKWRFNWFKNADNPSMTYKEV^ 
(SEQ ID NO: 24) 

[0103] In this amino acid sequence {SEQ ID NO: 24). amino acid residues 1 to 18 are signal peptides derived from 
NCE5. amino acid residues 1 9 to 24 are N-temiinal sequences of LACES, amino acid residues 25 to 36 are sequences 
derived from NCE5 or RGE I, and amino acid residue 37 and succeeding sequences are derived from the catalytic 
active domain of RGE I. 

[0104] When the reaction Is carried out in accordance with the above method, two types of sequences i e the 
sequence as shown In SEQ ID NO: 24 and the sequence In which amino acid residue 34 Is alanine, are obtained as 
amino acid sequences of the protein coded by the nucleotide sequence of the gene fragment H43'. in the following 
procedure, a gene fragment used has a nucleotide sequence, which encodes the amino acid sequence as shown in 
SEQ ID NO: 24. 

[Example 2) Gonstruction o f gene expressing RGE I variant H45 which lacks the cetlulose-binding domain 

[0105] pR1H4 obtained in Example 1 was used as a template to Introduce amino acid substitution into the RCE I 
gene using synthetic DNA. RG-A121P, and the TaKaRa LA PGR in vitro Mutagenesis Kit (Takara Shuzo Co Ltd ) 
Reaction conditions were in accordance with the manual attached to the kit. A sample was separated after the reaction 
by agarose gel electrophoresis to obtain an about Lkbp gene fragment L9. This fragment was digested with the re- 
striction enzyme BamH\ and ligated to the BamH\ site of plasmid pUG118 using the TaKaRa DNA Ligation Kit Ver 1 
to prepare plasmid pRIL9. Reaction conditions, such as those regarding enzymes, were In accordance with the manual 
attached to the kit. The obtained plasmid pR1L9 was subjected to sequencing reaction using the Cy5-Auto Read 
Sequencing Kit (Amersham Pharmacia), and the sequence was analyzed using the DNA Sequencer ALFred (Amer- 
sham Pharmacia). A synthetic DNA used as primer used in this reaction was H4-RI. As a result, it was confimied that 
the sequence was in the expected form, I.e., one alanine was substituted with proline. 
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RC-A121P: GACTGClXSCAAGCrGTCXJTGC (SEQ ID NO; 42); 

H4-R1: GTrGCACATGTACJOAGlTGC (SEQ D) NO: 43) 

[0106] Using this pR1L9 as a template, a gene encoding a portion of the secretion signal sequence in the RCE I 
gene was amplrf,ed. PCR was carried out using two synthetic DMAs, i.e., RC.451F and RC-451R p^ers thj 
TaKaRa Ex Taq Polymerase (Takara Shuzo Co., Ltd.), and attached buffers. The composition of the eacSJn so^t on 

ZZ ?So?r"? T T''°" '''''''' '-P-^"- condit! ns or e r^^^^^ 

Cycle (2400.R, Perkin Elmer) were 25 cycles of 94'C for 1 minute. 50»C for 2 minutes, and 72'C for 1 minute A 

sample was separatedafterthereactionbyagarosegelelectrophoresist^ 

R1 L as LTmoirp'cR '^l' T <^'^°) °' ' ampllfld u ing the 

S-L Fv t!„ P f ■ TJ^""!^ """3 'y"*^"^ "^NAs, l.e., RC-452F and RC452R, as primers the 
TaKaRa Ex Taq Potyirierase (Takara Shuzo Co.. Ltd.), and attached buffers. The composition of the reaction soL on 

r ! f^n^^o"^*"'^ ♦•'^ ^""^^^ "^^""^l- The temperlre conditions for t e Thermal 

Cycler (2400-R. Perkin Elmer) were 25 cycles of 94'C for 1 minute. 50»C for 2 minutes, and 72»C fo 1 mlutri 

sample was separatedafterthe reaction by agarosegelelectrophoreslstoobtalnanaboute^ bp genefr^grnr4^^^^^^ 
RC-45 JF: GCXSGArCCTGGGACAAGArO (SEQ ID Na 25); 



R&451R: GCCTGCAOAGCGGCGOAGGCCArc (SEQ ID NO: 26); 
RC452F: GCCTGCAGOGAAAQTACAGCX3CTGT (SOSQ ID NO: 27); 

RC-452R: GCGGATCCTGCOTITACrraC CSEQ ID NO: 28) 

X 1° "1 n^r/fT^"'* "^^^ ^'^ "^'^^ '''9^'^^'^ restriction enzyme Psfl, ligated toqether usino the 

Tat^RaDNA Ligation KitVerl.and then dlgestedwith the restriction enzyme eamHItoobteinaZSm^^^^^ 
PCR was earned out us.ng 45.X3 as a template, two synthetic DNAs, i.e., RC-451F and RC-S.^^primei S 

wf. in « '""^ T'^^ ^-""Po^""" °' reaction soT£ 

was ,n acco|^ance with the conditions specified in the attached manual. The temperature conditions for the Thermal 
Cycler (2400-R. Perkin Elmer) were 25 cycles of 94-C for 1 minute. 50'C for 2 minutes, and 72-C fo 1 mliute a 

sample was separated afterthereactlonby agarose gel electrophoresis to obtain an about 700bpgenef^rnrm^^ 
ThisfragmentwasAgest^ 

manua^r J H. T '''' ' '° P"^"^" ^^'^ accordance wth e 

TsholbSw 



H45: 



MKITnASSAUAL\IX3miASAAU3GKySAVSGGASGNGVTOYWDCCK^^ 



PGKANVSSPVKS<mDGVTALSDSNAQSGCNGGNSYMCNDNQPWAVNDNLAYGFAAA 
AISGGGESRWCCSCFELIPI^SVAGKKMWQVTMGGDUSSSK^ 

NGCSSQWGAPNIX3WGSRYGGISSASDCSSli«AIX2AGaCWRENViTKNADNPSM^^ 
TCPKETTAKTGCSRK (SEQ ID NO: 29) 

[0109] m this amino acid sequence (SEQ ID NO: 29). amino acid residues 1 to 23 are signal peptides derived from 
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RCE1, amino acid residue 24 is an N-terminal sequence of RCE I, amino acid residues 25 and 26 are sequences 
introduced by a primer used, and amino acid residue 27 and succeeding sequences are derived from the catalytic 
active domain of RCE I. 

[Example 3) Expression of codon optimized endoglucanase RCE I gene and Its variants RCE I-H43 and H45. which 
lacl< the celluloses-binding domains, in Humicola insolens ' ' 

[0110] Plasmid pJD01 (see Example D1 (2) (b) in WO 00/24879) was digested with BamHI and dephosphorylated 
using Escherichia co/f-derived alkaline phosphatase (Takara Shuzo Co.. Ltd.) in accordance with the attached manual. 
Plasmids pR1 H43 and pR1H45 obtained in Examples 1 and 2 respectively were also digested with eamHI to produce 
about 700 bp DNA fragments. The obtained fragments were ligated respectively with aforementioned plasmid pJDOl 
using the DNA Ligation Kit Ver. 1 to produce expression plasmids pJND-H43 and pJND-H46. 
[0111] Humicola insolensm200'^ (FERM BP-5977) wastransfonnedwithpJND-H43.pJND-H45. orpJI4D01 com- 
prising the codon optimized RCE I gene (plasmid comprising a codon optimized RCE I gene to express the Rhlzopus- 
derived RCE I In Humicola, see Example D3 (3) In WO 00/24879). Specifically, Humicola insolens MN 200-1 (FERM 
BP^977) was cultured In medium (S) (3.0% glucose. 2.0% yeast extract. 0.1% polypeptone. 0.03% calcium chloride, 
0.03% magnesium chloride (pH 6.8)) at 37°C for 24 hours, and then the cells were collected by centrifugation at 3,000 
rpm for 10 minutes. The collected cells were washed with 0.5M sucrose and suspended in 10 ml of enzyme solution 
for preparing protoplast (5 mg/ml Novozyme 234 (Novo), 5 mg/ml Cellulase Onozuka R-1 0 (Yakult), and 0.5M sucrose) 
filtered through a 0.45 \im filter. The suspension was shaken at 30»C for 60 to 90 minutes to render fungal threads to 
be protoplasted. This suspension was filtered and then centrlfuged at 2,500 rpm for 10 minutes, and protoplast was 
collected and washed with a SUTC buffer (0.5M sucrose, 10 mM calcium chloride, and 10 mM Tris hydrochloride (pH 
7.5)). 

[0112] The thus prepared protoplast was suspended in 1 mL of SUTC buffer, and 1 0 ^il of DNA (TE) solution was 
added to the suspension in amounts of 1 0 ^g per each 1 00 ^1 of the suspension. The mixture was allowed to stand in 
ice for 5 minutes. Subsequently, 400 ^1 of PEG solution (60% PEG 4000, 10 mM calcium chloride, and 10 mM Tris 
hydrochloride (pH 7.5)) was added and the mixture was allowed to stand in ice for 20 minutes. Thereafter. 10 ml of 
SUTC buffer was added, and centrifugation was carried out at 2.500 rpm for 1 0 minutes. The collected protoplast was 
suspended in 1 ml of SUTC buffer, centrifuged at 4,000 rpm for 5 minutes, and finally suspended in 100 ul of SUTC 
30 buffer. *^ 

[0113] The protoplast treated as above was superposed on a 200 ^g/ml hygromycin B-containing YMG medium (1% 
glucose. 0.4% yeast extract, 0.2% malt extract, 1 % agar (pH 6.8)) together with YMG soft agar. Culture was Incubated 
at 37°C for 5 days. Thereafter, the generated colony was detemnined to be a transformant. 
[01 14] The obtained transfomiant was cultured in medium (N) (5.0% Avicel, 2.0% yeast extract. 0.1% polypeptone 
0.03% calcium chloride, 0.03% magnesium chloride, pH 6.8) at 37'»C. A culture supernatant from which solid matter 
has been removed by centrifugation was determined to be an enzyme sample. 

[Example 4] Isolation and purification of RCE I variant from Humicola /nso/ens transformant 

[0115] Humicola insolens transformants were inoculated to medium (N) (5.0% Avicel, 2.0% yeast extract 0 1% 
polypeptone, 0.03% calcium chloride, 0.03% magnesium chloride. (pH 6.8)) and subjected to shake culture at 37''C 
Transformants Into which plasmids pJND-H43 and pJND-H45 had been introduced were cultured for 5 to 6 days. 
Regarding transformants into which plasmid pJI4D01 had been introduced, a sample was cultured for 4 days for ob- 
taining an RCE I enzyme, which was not degraded in its linker domain, and which sustained a cellulose-binding domain 
In contrast, a sample was cultured longer than 4 days. I.e.. for 5 to 6 days, for obtaining the RCE I enzyme, which was 
degraded In its linker domain and lacked the cellulose-binding domain. Each of the resulting culture solutions was 
centrifuged at 7.000 rpm for 20 minutes to remove cells, and the culture supernatant was determined to be a crude 
cellulase preparation. 

[01 16] An ammonium su If ate solution at a final concentration of 1 .5 M was prepared from 1 00 ml of this crude cellulase 
preparation and then applied at a flow rate of 10.0 ml/min to Macro-Prep Methyl HIC Support hydrophobic chromatog- 
raphy (270 ml in gel volume, BioRad Laboratories) which had been previously equilibrated with 1.5 M ammonium 
sulfate solution. It was then fractionated by eluting at a flow rate of 10.0 ml/min in a stepwise elution method in which 
the concentration of ammonium sulfate was decreased by 0.3 M each from 1.5 M. Fractions found to have strong 
activity of removing fuzz from lyocell were: a fraction obtained at an ammonium sulfate concentration of 1 2 M regarding 
a culture solution of the transformant into which plasmid pJND-H43 had been introduced (hereinafter referred to as an 
"H43 culture solution"); a fraction obtained at an ammonium sulfate concentration of 0.9 M regarding a culture solution 
of the transfomiant into which plasmid pJND-H45 had been introduced (hereinafter referred to as an "H45 culture 
solution"); a fraction obtained at an ammonium sulfate concentration of 0.6 M regarding a culture solution of the trans- 
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formant into which plasmid pJi4D01 had been introduced and cultured for 4 days (hereinafter referred to as an "H4 
culture solutton"); and a fraction obtained at an ammonium sulfate concentration of 0.9 M regarding a culture solution 
of the transfomiant Into which plasmid pJI4D01 had been Introduced and for which a culture period had been extended 
to accelerate its degradation (hereinafter referred to as an ■H4 degradation product"). Therefore, 1 00 ml each of these 

fractions was fractionated. 

[01 1 7] An ammonium sulfate solution at a final concentration of 1 .5 M was prepared from 1 00 ml of the obtained 
active fractions and then applied again at a flow rate of 10.0 ml/min to Macro-Prep Methyl HIC Support hydrophobic 
chromatography (270 ml In gel volume, BioRad Laboratories) which had been previously equilibrated with 1 5 M am- 
monium sulfate solution. It was then fractionated by eluting at a flow rale of 10.0 ml/min in a stepwise elution method 
in which the concentration of ammonium sulfate in deionized waterwas decreased by 0,1 5 M each from 1 5 M Fractions 
found to have activities of removing fuzz from lyocell were: a fraction obtained at an ammonium sulfate concentration 
of 1 .35 M regarding the H43 culture solution; a fraction obtained at an ammonium sulfate concentration of 1 05 M 
regarding the H45 culture solution; a fraction obtained at an ammonium sulfate concentration of 0.75 M regarding the 
H4 culture solution; and a fraction obtained at an ammonium sulfate concentration of 1 .05 M regarding the H4 deara- 
dation product. Therefore. 40 ml each of these fractions was fractionated. 

[01 1 8] An ammonium sulfate solution at a final concentration of 1 .5 M was prepared from 40 ml of the obtained active 
rations and then applied at a flow rate of 4.0 ml/min to Macro-Prep Methyl HIC Support hydrophobic chromatography 
(25 ml in gel volume, BioRad Laboratories) which had been previously equilibrated with 1.5 M ammonium sulfate 
solution. It was then fractionated by eluting at a flow rate of 4.0 ml/min In deionized water. Among these fractions 8 
ml of fraction, which was found to have strong activity of removing fuzz from lyocell, was fractionated 

10119] Acetatebuffer(150ml,50mM.pH4.0)waspreparedbydilutlngtheobtainedactlvefractlons,andtheresultlnq 
buffer was then applied at a flow rate of 2 mlAnin to MonoS 10-/1 0-HR column (Amersham Phamiacia), which had 
been previously equilibrated with 50 mM acetate buffer (pH 4.0). it was then fractionated by eluting at a flow rate of 2 
mMmin in a stepwise elution method in which the concentration of NaCI in 50 mM acetate buffer (pH 4 0) was increased 
by 0.1 M each to 1 M NaCI in 50 mM acetate buffer (pH 5.0). Fractions that were obtained at a NaCI concentration of 
about 0.2 to 0.3M were found to have activity of removing fuzz from lyocell. Therefore, 6 ml of fraction found to have 
the strongest activity was fractionated. These fractions showed in SDS-PAGE a single band of about 25 KDa regardino 
proteins punfied from the H43 culture solution, the H45 culture solution, and the H4 degradation product and a single 
band of about 40 KDa regarding the protein purTied from the H4 culture solution . 

[0120] SDS-PAGE was carried out using the system of Tefco In which an electrophoresis tank (No. 03-1 01) a source 
(Mode^: 3540) 10% gel (01-015), anda buffer kitfor SDS-PAGE (06-0301) were used. Theconditionforelectrophoresis 
was 1 8 mA/10 m n and then 20 mA/90 min. In protein detection after the electrophoresis, silver staining was carried 
out using a 2D-8ilver staining reagent IT "DAIICHI" (Daiichi Pure Chemicals Co., Ltd.) for electrophoresis A standard 
protein used as a marker was the SDS-PAGE molecular weight standard protein, Low Range (161-0304 BioRad) 
[01 21 ] The activity of removing fuzz from lyocell was measured In accordance with the following meth^ 
'°KK^^ K Colo; knitted fabric of lyocell (Toyoshima Japan) was fuzzed in a large washer together with a surfactant and 
rubber balls. Thereafter, the fuzzy knitted fabric of lyocell (Toyoshima Japan, 9 cm x 1 0 cm, about 2 g in weight) was 
^lindrically sewn and subjected to fuzz removal treatment with various enzymes under the conditions as set forth 
below. The protein concentrations required to completely remove fuzz existing in the cylindrical fabric by this treatment 

WGfO C3lCUl3t6Cj. 

[01 23] The protein concentrations of various endoglucanases were calculated from the peak area at UV 280 nm of 
respective endoglucanase eluted with a linear gradient from 0% to 80% of acetonitrile concentration In 0 05% TFA 
(tnfluoroacetic acid) at a flow rate of 1 .0 ml/min in HPLC analysis using TSK gel TMS-250 column (4.6 mm LD x 7 5 
cm, TOSOH Japan). The standard used was the purified NCE4, which was analyzed In HPLC under the same condi- 
tions, the protein concentration of which had been preliminarily measured by the protein assay kit (BioRad Laborato- 
ries). The standard used to measure the protein concentration for the protein assay kit was albumin standard (Bovine 
serum albumin, fraction V, PIERCE). The purified NCE4 (coded by nucleotides 118 to 1088 In SEQ ID NO- 18) was 
isolated and purified from a culture solution of Humicola insokns according to the method described in WO 98A)3640 
Test machine: Launder Meter L-12 (Daiei Kagaku SeikI MFG, Japan) 
Temperature: 55°C 
Time: 60 minutes 
Reaction volume: 40 ml 

Reaction pH: pH 5 (10 mM acetate buffer) 
pH 6 (1 0 mM acetate buffer) 

[0124] The treating llqukl contained 4 njbber balls (about 16 g each) together with the endoglucanase solution. 
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^1 identification of N-terminal amino acid se quences of RCE I variant protein isolatod .nH nnnUaH tro^^ 
Humtcola /nso/ens transfomiant '^om 

S *h?f?I '° ^"'^^'"t;^" ^^'d ^^q^^nces of the purified proteins obtained In Example 4 

each of the fractions was subjected to SDS-PAGEmini (Tefco), electroblotted on a PVDF membrane stained with 

on rrH "; ^'TTT'T ''''''' '"'-^^ ^^^^^^ then air dl Tpol 

J 2Ta^^^^^^^^^^ r '''''''''' ^"^j^^^^^ t^ ^ ^^<^t^- Sequencer (Modei 

492. PE Applied Biosystems) to analyze the N4emilnal amino acid sequence. The amino acid sequences we e read 

without any problem. Regarding the enzyme purified from the H43 culture solution, however, no signal was generated 
by Edman degradation and the N-terminal amino acid was found to be modified and protected. Thus enz^Jie^^^^ 

fo 30 minutes and. after a protein unbound portion on the membrane was blocked, this enzyme was treated with pfu 

Thus, sequencing was earned out once more. The obtained sequences were as shown below. 

The N-temiinal amino acid sequence of RCE I-H43: 

[0126] 



Can-Scr-GIy-iScr-Gly-Aig-Tlir (7 residues) (SEQ ID NO: 30). 

The N-tennlnal amino acid sequence of RCE I-H45: 
[0127] 

Lys-Tyr^er-Ala-VaWesr-Gly (7 zesidues) (SEQ ID NO: 31); 

and 

Ala-ValnSer-Glv-Glv-Ala-Ser a vendues) (SEQ ID NO: 32^. 

The N-terminal amino acid sequence of RCE l-H4(25KDa): 
[0128] 

Ser-Aia-W-Ser-Gly-<?Iy- Ala C7 residues) (SEQ ID NO: 33); 

and 

Gly^ly-Ala-Ser-Gly Asn-Gly (7 readues) (SEQ ED NO: 34). 

The N-temlnal amino acid sequence of RCE I-H4 (40KDa): 
[0129] 

Ala-01u-(Cys)nSa'-L>^-Leu-Tyr (7 residues) (SEQ ID NO: 35). 
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[0130] As a result of identification of N-tenninal amino acid sequences, it was found that only the enzyme purified 
from the H4 culture solution (hereinafter referred to as "RCE I-H4 (40 KDa)") had a cellulose-binding domain (CBD) 
while any of the remaining enzyme purified from the H43 culture solution (hereinafter referred to as "RCE I-H43 (25 
KDa)"), the enzyme purified from the H45 culture solution (hereinafter referred to as "RCE I-H45 (25 KDa)"), or the 
enzyme purified from the H4 degradation product (hereinafter referred to as "RCE I-H4 (25 KDa)") had no cellulose- 
binding domain (CBD) but had only the catalytic active domains (CAD). 

[Example 6] Comparison o f specific activity of removing fuzz from cotton fabric between RCE I. which lacks the 
cellulose-binding domain, and RCE L which has the cellulose-binding domain 

[0131] Using the endoglucanase which was homogenously purified in Example 5, a knitted cotton fabric (a 6 cm x 
8 cm fabric from cotton smooth knit No. 3900, Nitto Boseki Co.. Ltd.. dyed in brown by reactive dye at Tsuyatomo 
Senko), which has been fuzzed in a large washer, was subjected to fuzz removal under the following conditions. The 
amount of fuzz remaining unremoved was visually evaluated, and the amount of purified enzyme to be added so that 
the amount of remaining fuzz would reach 50% was detemriined. The amount of protein was detemilned using the BCA 
Protein Assay Reagent (PIERCE) In accordance with the conditions described in the attached manual. The estimated 
molecular weight of the 40 KDa purified RCE I protein (RCE I-H4 (40 KDa)) Is about 1 .5 times higher than those of the 
25 KDa purified RCE I proteins (RCE |.H43 (25 KDa)), RCE I-H45 (25 KDa)). and RCE I-H4 (25 KDa)). Accordingly, 
even though the quantified amounts of proteins are the same, the number of enzyme molecules in the 40 KDa purified 
RCE I protein is only about two-thirds of that In the 25 KDa purified RCE I protein In temis of the number of enzyme 
molecules contained in the protein. 

Test machine: Launder Meter L-20 (Dalel Kagaku SeikI MFG, Japan) 
Temperature: 40*'C or 55°C 
Time: 120 minutes 
Reaction volume: 40 ml 

Reaction pH: Reacted at pH 7 (1 mM phosphate buffer, prepared using deionized water). 

[0132] The treating liquid contained 4 mbber balls (about 16 g each) together with the enzyme solution. 

[0133] The results are as shown in Table 2 below. 



Table 2 





The amount of enzyme added 
40''C 


The amount of enzyme added 

ss-c 


Purified RCE I-H4 (40 KDa) 
Purified RCE I-H4 (25 KDa) 


390 ^g or more 
18 ng 


390 ^g or more 
53 fig 



[01 34] As is apparent from the results shown in Table 2, the 25 KDa protein, which lacks the cellulose-binding domain 
(CBD). exhibits much higher activity of removing fuzz from cotton fabrics than the 40 KDa protein, which has the 
cellulose-binding domain (CBD), even though both proteins are originated from the same 2ygomycetes-derived en- 
doglucanases RCE I. 



[Example 7] Comparison of specific activit y of removing fuzz from lyoceii fabric between RCE I. which lacks the 
cellulose-binding domain, and RCE I. which has the cellulose-binding domain ' " 

[0135] Using the endoglucanase which was homogenously purified in Example 6, a lyocell fabric (6 cm x 8 cm 
Toyoshima Japan) was subjected to fuzz removal under conditions improved from the method described in Example 
4. The amount of fuzz remaining unremoved was visually evaluated, and the amount of purified enzyme required to 
completely remove fuzz was detemfilned. The amount of protein was determined using the BCA Protein Assay Reagent 
(PIERCE) in accordance with the conditions described in the attached manual. The estimated molecular weight of the 
40 KDa purified RCE I protein (RCE I-H4 (40 KDa)) is about 1 .5 times higher than those of the 25 KDa purified RCE 
I proteins (RCE I-H43 (25 KDa)), RCE I-H45 (25 KDa)). and RCE I-H4 (25 KDa)). Accordingly, even though the quan- 
lifted amounts of proteins are the same, the number of enzyme molecules in the 40 KDa purified RCE I protein is only 
about two-thirds of that in the 25 KDa purified RCE I protein in temis of the number of enzyme molecules contained 
in the protein. Test machine; Launder Meter L-20 (Dalel Kagaku Seiki MFG, Japan) 
Temperature: 40** C 
Time: 90 minutes 
Reaction volume: 50 ml 

Reaction pH: Reacted at pH 6 (10 mM acetate buffer, prepared using deionized water). 
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[0136] The treating liquid contained 4 rubber balls (about 16 g each) together with the enzyme solution. 
[0137] The results are as shown In Table 3 below. 



Tables 





The amount of enzyme added 


Purified RCE I-H4 (40 KDa) 
Purified RCE I-H4 (25 KDa) 


32 Jig 
11^9 



10 [0138] As is apparent from the results shown in Table 3, RCE I, which is Zygomycetes-derived endoglucanase, 
exhibits a higher specific activity of removing fuzz from lyocell fabrics as the 25 KDa protein, which lacks the cellulose- 
binding domain (CBD), than as the 40 KDa protein, which has the cellulose-binding domain (CBD). 

[Example 8] Comparison of specific activity of removing fuzz from cotton fabric between RCE I, which lacks the 
f5 cellulose-binding domain, and RCE I. which has the cellulose-binding domain, under alkaline, low-temperature, and 
surfactant-present conditions 

[0139] Using the endoglucanase which was homogenously purified in Example 5, a knitted cotton fabric (a 6 cm x 
8 cm fabric from cotton smooth knit No. 3900, Nitto Boseki Co., Ltd., dyed in brown by reactive dye at Tsuyatomo 

20 Senko), which had been fuzzed in a large washer, was subjected to fuzz removal under the following conditions. The 
amount of fuzz remaining unremoved was visually evaluated, and the amount of purified enzyme to be added so that 
the amount of remaining fuzz would reach 50% was detennlned. The amount of protein was detennlned using the BCA 
Protein Assay Reagent (PIERCE) in accordance with the conditions described in the attached manual. The estimated 
molecular weight of the 40 KDa purified RCE I protein (RCE I-H4 (40 KDa)) is about 1 .5 times higher than those of the 

25 25 KDa purified RCE I proteins (RCE I-H43 (25 KDa)). RCE I-H45 (25 KDa)). and RCE I-H4 (25 KDa)). Accordingly, 
even though the quantified amounts of proteins are the same, the number of enzyme molecules in the 40 KDa purified 
RCE I protein is only about two-thirds of that in the 25 KDa purified RCE I protein in temis of the number of enzyme 
molecules contained in the protein. 

Test machine: Launder Meter L-20 (Daiel Kagaku Seiki MFG, Japan) 
30 Temperature: 30*C 
Time: 120 minutes 
Reaction volume: 40 ml 

Reaction pH: Reacted at pH 10 (5 mM sodium carbonate buffer, prepared using deionized water). 
[0140] The treating liquid contained a nonionic surfactant Persoft NK-100 (NOF Corp.) at a final concentration of 
35 1 00 ppm together with the enzyme solution and 4 rubber balls (about 1 6 g each). 
[0141] The results are as shown in Table 4 below. 



Table 4 





The amount of enzyme added 


Purified RCE I-H4 (40 KDa) 
Purified RCE I-H4 (25 KDa) 


390 Jig or more 
52 Jig 



[0142] As is apparent from the results shown in Table 4, given the low-temperature, alkaline, and surfactant-present 
45 conditions under which detergents are actually used, the 25 KDa protein, which lacks the cellulose-binding domain 
(CBD), exhibits much higher activity of removing fuzz from cotton fabrics than the 40 KDa protein, which has the 
cellulose-binding domain (CBD), even though both proteins are originated from the same Zygomycetes-derived en- 
doglucanases RCE 1. 

50 [Example 9] Comparison of specific activity of removing fuzz from cotton fabric among various purified RCE I proteins, 
which lack the cellulose-binding domains 

[0143] Using the endoglucanase which was homogenously purified in Example 5, a knitted cotton fabric (a 6 cm x 
8 cm fabric from cotton smooth knit No. 3900, Nitto Boseki Co., Ltd., dyed in brown by reactive dye at Tsuyatomo 
55 Senko), which had been fuzzed in a large washer, was subjected to fuzz removal under the following conditions. The 
amount of fuzz remaining unremoved was visually evaluated, and the amount of purified enzyme to be added so that 
the amount of remaining fuzz would reach 50% was detennined. The amount of protein was detemiined using the BCA 
Protein Assay Reagent (PIERCE) in accordance with the conditions described in the attached manual. The estimated 
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S°L'^fnuZ?l,?p f ''T'' ? ' ^'^^^ ^ 5 times higher than those of the 

25 KDa punfied RCE I proteins (RCE I-H43 (25 KDa)), RCE I-H45 (25 KDa)), and RCE I-H4 (25 KDa)) Accordinalv 

^n hough thequantified amounts Of proteins are the same.the number Of Lyme mol^^^^^^^^ 

srnrai2rprter°'*'"'"'^^"^°^^"'^"^^ 

Test machine: Launch Meter L-20 (Dale! Kagaku Seiki MFG, Japan) 
Temperature: 40^0 
Time: 120 minutes 
Reaction volume: 40 ml 

Reaction pH: Reacted at pH 7 (1 mM phosphate buffer, prepared using delonlzed water) 

ni^cl 11^ ''"^^i"^ ""^"'"^ "^"^"'""^ "^^^^^ 1« 9 together with the enzyme solution 

[0145] The results are as shown in Table 5 below. 



Table 5 





The amount of enzyme added 


Purified RCE I-H4 (40 KDa) 
Purified RCE I-H43 (25 KDa) 
Purified RCE I-H45 (25 KDa) 
Purified RCE I-H4 (25 KDa) 


390 ng or more 
26 fig 
18 fig 
18ttg 



u . J ^^^^'^"^ '^"^ '^^^^ 5, various RCE I proteins such as RCE I-H43 (25 KDa) RCE 

l-H-45 (25 KDa). and RCE I-H4 (25 KDa). which lacK the celiulose-binding domains, exhibrtm ch higher aSlSr o^ 

IS r ? ^1 ''''' ' '^"^'^ '"^ cellulose-binding domain (CBD), exhibited hiZ 

itrirZr ^rrrr '^^"^ '° P™'^'"- ^^'^^'^ cellulose-Ling domrcCBoT 

regardless the length of the Imkerdomain remaining on the 

e hibrt IThSr ".Tr'*"'" f ' P^'^'"- ""^"^^ "^^^ cellulose-binding domain (CBD) 

s^t^iiroircii^^^^^^^ 
iacVt°J ecTotr^^^^^^ 



10147] Using the endoglucanase which was homogenous^ purified In Example 6, a lyocell fabric (6 cm x 8 cm 

4. The amount of fuzz remaining unremoved was visually evaluated, and the amount of purified enzyme requSo 
completeyremovefuzz 

in^n ^ '"«Ta^lT "''^"^^ i" attached manual. The estimated molecular St of the 

40 KDa PU"fled RCE I protein (RCE 1-84 (40 KDa)) is about 1 .5 times higher than those of the 25 KDa pSed RCE 
proterns (RCE -H43 (25 KDa)), RCE I-H45 (25 KDa)), and RCE I-H4 (25 KDa)). Accordingly even^ughtfal^^^ 
t,f.ed amounts of pr<«elns are thesame,the number Of enzyme molecules In the40KDapuri^^^^ 

'''^'"'''''^^''''''^^^^^^ 

Test machine: Launder Meter L-20 (Daiei Kagaku Selkl MFG, Japan) 
Temperature: 40°C 
Time: 90 minutes 
Reaction volume: 40 ml 

Reaction pH: Reacted at pH 6 (1 0 mM acetate buffer, prepared using delonlzed water) 

S JS III rlt'i^^ contained 4 rubber balls (about 16 g each) together with the enzyme solution. 

[0149] The results are as shown in Table 6 below. 



Table 6 





The amount of enzyme added 


Purified RCE I-H4 (40 KDa) 
Purified RCE I-H43 (25 KDa) 


32 ng 

12 ^g 
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Tables (continued) 



The amount of enzyme added 



5 



Purified RCE I-H45 (25 KDa) 
Purified RCE I-H4 (25 KDa) 



11 ^ig 

11 ng 



[0150] As Is apparent from the results shown in Table 6, various RCE I proteins such as RCE I-H43 (25 KDa), RCE 
l-H-45 (25 KDa), and RCE I-H4 (25 KDa), which lacl<the cellulose-binding domains, exhibit higher activity of removing 
fuzz from lyoceil fabrics than the 40 KDa protein RCE I-H4 (40 KDa). which has the celiulose-binding domain (CBD). 
This indicates that the RCE i protein, which lacks the celiulose-binding domain (CBD), exhibits higher activity of re- 
moving fuzz from lyoceil fabrics than the 40 KDa protein, which has the cellulose-binding domain (CBD) regardless 
the length of the linker domain remaining on the N-terminal side of the catalytic active domain. Also, whether or not it 
is an artificial or non-artificial deficiency, the RCE I protein, which lacks the cellulose-binding domain (CBD), exhibits 
higher activity of removing fuzz from lyoceil fabrics than the 40 KDa protein RCE I-H4 (40 KDa), whteh has the cellulose- 
binding domain (CBD). 

[0151] All publications, patents, and patent applications cited herein are incorporated herein by reference in their 
entirety. 

Industrial Applicability 

[0152] When Zygomycetes-derived endoglucanase such as RCE I, RCE II, RCE III, MCE I, MCE II, or PCE I, which 
lacks the cellulose-binding domain, is allowed to act, the endoglucanase activity can be significantly enhanced in com- 
parison with endoglucanase having a cellulose-binding domain. Accordingly, fabric treatment such as reduction of 
fuzzing, improvement In feel and appearance, color clarification, partial color change, and softening of cellulose-con- 
taining fabrics and Improvement in deinking of waste paper and drainage of paper pulp can be effected with a smaller 
amount of enzymes. This can decrease necessary costs remari^ably 

Free Text of Sequence Listings 



SEQ ID NO: 13: codon optimized sequence corresponding to RCE i protein (SEQ ID NO: 2) 

SEQ ID NO: 1 7: consensus amino acid sequence found in the cellulose-binding domain of family 46 endoglucanase 

SEQ ID NO: 20 to 23: primers 

SEQ ID NO: 24: recombinant protein 

SEQ ID NO: 25 to 28: primers 

SEQ ID NO: 29: recombinant protein 

SEQ ID NO: 30 to 35: N-tenmlnal amino acid sequence of recombinant protein 

SEQ ID NO: 36 and 37: primers 

SEQ ID NO: 38: NCE5 amino acid sequence 

SEQ ID NO: 39: NCE5 cDNA sequence 

SEQ ID NO: 40 to 43: primers 



30 



[0153] 



45 



50 



55 



Patent provided by Sughru^ion, PLLC - http://www.sughrue.com 



EP 1344 820 A1 



SEQUENCE LISTING 
<110> MEIJI SEIKA KAISHA. LTD. 

<]20> EodoglucaDase Enzyme from Zygonycetes Having Deletion of 
Cellulose Binding Domain 

<130> PH-1443-PCT 

<1S0> JP2D00-354296 
<151> 2000-11-21 

<160> 43 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 338 
<212> PRT 

<213> Rhizopus oryzae CP96001 
<220> 

<221> siglpeptide 
<222> (-23)... (-1) 

<220> 

<221> matlpeptide 
<222> (D... (315) 
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<400> 1 

Met Lys Phe lie Thr He Ala Ser Ser Ala Leu Leu Ala Leu Ala Leu 

-20 -15 -JO 

Gly Thr Glu Met Ala Ser Ala Ala GIu Cys Ser Lys Leu Tyr Gly Gin 

-5 1 5 

Cys Gly Gly Lys Asn Trp Asd Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Ser Gly 

30 35 40 

Ser Ser Gly Asn Lys Ser Ser Glu Ser Ala His Lys Lys Thr Thr Thr 

45 50 55 

Ala Ala His Lys Lys Thr Thr Thr Ala Ala His Lys Lys Thr Thr Thr 

60 65 70 

Ala Pro Ala Lys Lys Thr Thr Thr Val Ala Lys Ala Ser Thr Pro Ser 

75 80 85 

Asn Ser Ser Ser Ser Ser Ser Gly Lys Tyr Ser Ala Val Ser Gly Gly 
90 95 100 105 

Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala 

110 115 120 

Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser Pro Val Lys Ser 

125 130 135 

Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser Asn Ala Gin Ser 

140 145 ISO 

Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin Pro Trp 

155 160 165 

Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala He Ser 
no 175 180 185 

Gly Gly Gly Glu Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe 
190 195 200 
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Thr Ser TJir Ser Val Ala Gly Lys Lys Met Val Val Gin Yal Thr Asb 

205 210 215 

Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin 

220 225 230 

Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys Ser Ser Gin Trp 

235 240 245 

Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly He Ser Ser 
250 255 260 265 

Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys 

270 275 280 

Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Aso Pro Ser Met Thr Tyr 

285 290 295 

Lys Glu Val Thr Cys Pro Lys Glu He Thr Ala Lys Thr Gly Cys Ser 
300 305 310 

Arg Lys 
315 

<210> 2 
<211> 1017 
<212> DNA 

<213> Rhizopus oryzae CP96001 
<220> 

<221> siglpeplide 
<Z22> (D... (69) 

<220> 

<221> inalffpeptlde 
<222> (70)... 11017) 
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<400> 2 

atg aag ttt att act att gcc tct tec get etc ttg get etc gee etc 4i 
Met Lys Phe He Thr He Ala Ser Ser Ala leu Leu Ala Leu Ala Leu 

-20 -15 >10 

ggt act gaa atg gcc tct get get gaa tgt age aaa ttg tat ggt caa 96 
Gly Thr Glu Met Ala Ser Ala Ala Glu Cys Ser Lys Leu Tyr Gly Cln 

-5 1 5 

tgt ggt ggt aag aac tgg aat ggc cct act tgt tgt gaa tct gga tec 144 
Cys Gly Gly Lys Asn Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

acc tgt aaa gta age aac gat tac tac tct caa tgt ctt ccc tct gga 192 
Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Ser Gly 

30 35 40 

age agt ggc aat aaa tct tct gaa agt get cac aag aag act acc aet 240 
Ser Ser Gly Asa Lys Ser Ser Glu Ser Ala His Lys Lys Thr Thr Thr 

45 50 55 

get get cac aag aag act act acc get get cat aaa aag act ace act 288 
Ala Ala His Lys Lys Thr Thr Thr Ala Ala His Lys Lys Thr Thr Thr 

60 65 70 

get cct get aag aag act aca act gtt gcc aaa get tee, ace cct tct 336 
Ala Pro Ala Lys Lys Thr Thr Thr Val Ala Lys Ala Ser Thr Pro Ser 

75 80 85 

aac tct age tct age tec age ggc aaa tat tec get gtc tct ggt ggt 384 
Asn Ser Ser Ser Ser Ser Ser Gly Lys Tyr Ser Ala Val Ser Gly Gly 
90 95 100 

fee tct ggt aac ggt gtc act aet cgt tat tgg gat tgc tgt aag gcc 432 
Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala 
no 115 
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tec tgl age 
Ser Cys Ser 

tgt aac aaa 
Cys Asn Lys 
140 

ggc tgt aac 
Gly Cys Asn 
155 

get gta aac 
Ala Val Asn 
170 

ggt ggt ggt 
Gly Gly Gly 



tgg ccc ggt aag 
Trp Pro Gly Lys 
125 

gat ggt gtc act 
Asp Gly Val Thr 



ggt ggt 
Gly Gly 

gac aac 
Asp Asn 



act tot acc 
Thr Ser Tfcr 

act ggt ggt 
Thr Gly Gly 
220 

atg ccc ggt 
Met Pro Gly 

235 
ggt get ccc 
Gly Ala Pro 
250 

gca tct gac 
Ala Ser Asp 



gaa tct 
Glu Ser 
190 
tct gtt 
Ser Yal 
205 

gat ctt 
Asp Leu 



aac agt 
Asn Ser 
160 
ctt gee 
Leu Ala 
175 

cgc tgg 
Arg Trp 

get ggt 
Ala Gly 

ggc tec 
Gly Ser 



ggt ggt gtt ggt 
Gly Gly Yal Gly 
240 

aat gac ggt tgg 
Asn Asp Gly Trp 
255 

tgc let agt ctt 
Cys Ser Ser Leu 



gee aat gtc agt tct cct gtc aag tec 480 
Ala AsD Val Ser Ser Pro Val lys Ser 

130 135 
gcc ctt agt gac age aat gcc caa agt 528 
Ala Leu Ser Asp Ser Asn Ala Gin Ser 
145 150 
tac atg tgt aac gac aac caa cct tgg 576 
Tyr Met Cys Asn Asp Asn Gin Pro Trp 
165 

ttc get get get gcc ate agt 624 
Phe Ala Ala Ala Ala He Ser 
180 185 
tct tgt ttc gaa ctt act ttc 672 
Ser Cys Phe Glu Leu Thr Phe 
195 200 
atg gtt gtc caa gtc act aac 720 
Met Yal Val Gin Val Thr Asn 
215 

ggt get cac ttt gac ttg caa 768 

Gly Ala His Phe Asp Leu Gin 
230 

aat ggt tgt tec age caa tgg 816 
Asn Gly Cys Ser Ser Gin Trp 
245 

aga tac ggt ggt att tct tct 864 
Arg Tyr Gly Gly He Ser Ser 
260 265 
gca etc caa get ggt tgt aaa 912 
Ala Leu Gin Ala Gly Cys Lys 



tat ggt 
Tyr Gly 

tgc tgt 
Cys Cys 

aag aag 
Lys Lys 
210 
tct act 
Ser Thr 
225 

att ttc 
He Phe 

ggc tea 
Gly Ser 

cct tec 
Pro Ser 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



270 275 280 

tgg aga Itc aac tgg ttc aag aac get gat aac cca age atg act tac 960 
Trp Arg Phe Asd Trp Phe lys Asn Ala Asp Asn Pro Ser Met Thr Tyr 

285 290 295 

aag gaa gtt acc tgt cct aag gaa ate aee gcc aag aca ggt tgt tea 1008 
Lys Glu Val Thr Cys Pro Lys GIu He Thr Ala Lys Thr Gly Cys Ser 

300 305 310 

aga aaa taa jgj^ 
Arg Lys 
315 

<210> 3 
<211> 366 
<2I2> PRT 

<213> Rhizopus oryzae CP96001 
<220> 

<221> sig4)peptide 
<222> (-23)... (-1) 

<220> 

<221> mattpeptide 
<222> (D... (343) 

<400> 3 

Met Lys Phe He Thr He Thr Ser Ser Ala Leu Leu Ala Leu Ala Leu 

-20 -15 -10 

Gly Thr Glu Met Ala Ser Ala Ala Lys Cys Ser Lys Leu Tyr Gly Gin 
-5 1 5 
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Cys Gly Gly lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu 

30 35 40 

Ser Asn Gly Asn Lys Ser Ser Glu Cys Ser Lys Leu Tyr Gly Gin Cys 

« 50 55 

Gly Gly. lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser Thr 

60 65 70 

Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu Ser 

7S 80 85 

Asn Gly Asn Lys Thr Ser Glu Ser Ala His Lys Thr Thr Thr Thr Thr 
90 95 100 105 

Ala Pro Ala Lys Glu lie Thr Thr Thr Ala Lys Ala Ser Asn Ser Ser 

110 115 120 

Asn Ser Ser Gly Lys Tyr Ser He Val Ser Gly Gly Ala Ser Gly Asn 

125 130 135 

Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala Ser Cys Ser Trp 

140 145 150 

Pro Gly lys Ala Asn Val Ser Ser Pro Val lys Ser Cys Asn Lys Asp 

155 160 165 

Gly Val Thr Ala leu Ser Asp Ser Asn Val Gin Ser Gly Cys Asn Gly 
170 175 180 185 

Gly Asn Ser Tyr Net Cys Asn Asp Asn Gin Pro Trp Ala Val Asn Asp 

190 195 200 

Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala He Ser Gly Gly Gly Glu 

205 210 215 

Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe Thr Ser Thr Ser 

220 225 230 

Val Ala Gly Lys Lys Met Val He Gin Val Thr Asn Thr Gly Gly Asp 
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235 240 245 

Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin Met Pro Gly Gly 
250 255 260 265 

Gly Val Gly He Phe Asn Gly Cys Ser Lys Gin Trp Gly Ala Pro Asn 

270 275 280 

Asp Gly Trp Gly Ser Arg Tyr Gly Gly He Ser Ser Ala Ser Asp Cys 

285 290 295 

Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys Trp Arg Phe Asn 

300 305 310 

Trp Phe lys Asn Ala Asp Asn Pro Ser Met Thr Tyr Lys Glu Yal Tbr 

315 320 325 

Cys Pro Lys Glu He Thr Ala Lys Thr Gly Cys Ser Arg lys 
330 335 340 

<210> 4 
<211> 1101 
<212> DNA 

<213> Rhizopus oryzae CP96001 
<220> 

<221> sigtpeptide 
<222> {!)... (69) 

<220> 

<221> nattpeptide 
<222> (70)... (1101) 

<400> 4 

atg aag ttt att act att acc tct tec get etc ttg get etc gee ctt 
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Mel lys Phe He Thr He Thr Ser Ser Ala Leu Leu Ala Leu Ala Leu 

-20 -15 -10 

ggl act gaa atg gcc tct get get aaa tgt age aag ctg tat ggt caa 96 
Gly Thr Glu Met Ala Ser Ala Ala Lys Cys Ser Lys Leu Tyr Gly Gin 

-5 1 5 

tgt ggt ggt aag gac Igg aaf ggc cct act tgt tgc gaa tct gga tec 144 
Cys Gly Gly Lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

acc tgt aaa gta age aac gat tac tac tct caa tgt ett gcc cct gaa 192 
Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu 

30 35 40 

age aac ggc aat aag tct tct gaa tgt age aag ttg tat ggt caa tgt 240 
Ser Asn Gly Asn Lys Ser Ser Glu Cys Ser Lys Leu Tyr Gly Gin Cys 

45 50 55 

ggt ggt aag gac tgg aat ggc cct act tgt tgc gaa tct gga tec acc 288 
Gly Gly Lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser Thr 

60 65 70 

tgt aaa gta age aac gat tac tac let caa tgt ett gcc cct gaa age 336 
Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu Ser 

75 80 85 

aat ggc aat aaa act let gaa age get cat aaa acg act act acc act 384 
Asn Gly Asn Lys Thr Ser Glu Ser Ala His Lys Thr Thr Thr Thr Thr 
90 95 100 105 

get ccc get aag gaa alt aea act act gcc aaa get tea aac let tct 432 
Ala Pro Ala Lys Glu He Thr Thr Thr Ala Lys Ala Ser Asn Ser Ser 

110 115 120 

aac tct age ggc aaa tac tec att gtc tct ggt ggt gcc tct ggt aac 480 
Asn Ser Ser Gly Lys Tyr Ser He Val Ser Gly Cly Ala Ser Gly Asn 
125 130 135 
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ggt gtc act act cgt tat tgg gat tgc tgt aag gee tec tgt age tgg 528 
Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala Ser Cys Ser Trp 

140 145 150 

ccc ggt aag gee aat glc agt tct cot gtc aag tec tgt aac aaa gat 576 
Pro Gly Lys Ala Asn Val Ser Ser Pro Val Lys Ser Cys Asn Lys Asp 

155 160 165 

ggt gtc act gcc ctt agt gac age aat gtc caa agt ggc tgt aac ggt 624 
Gly Val Thr Ala Leu Ser Asp Ser Asn Val Gin Ser Gly Cys Asn Gly 
170 175 180 185 

ggt aac agt tac atg tgt aac gac aac cag cct tgg get gta aac gat 672 
Gly Asn Ser Tyr Net Cys Asn Asp Asn Gin Pro Trp Ala Val Asn Asp 

190 195 200 

aat ctt gcc tat ggt ttc get get get gcc ate agt ggt ggt ggt gaa 720 
Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala He Ser Gly Gly Gly Glu 

205 210 215 

tct cgc tgg tgc tgt tct tgt ttc gaa ctt act ttc act tct acc tct 768 
Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe Thr Ser Thr Ser 

220 22S 230 

gtt get ggt aag aag atg gtt ate caa gtc act aac act ggt ggt gat 816 
Val Ala Gly Lys Lys Met Val He Gin Val Thr Asn Thr Gly Gly Asp 

235 240 245 

ctt ggc tec tct act ggt get cac ttt gac ttg caa atg ccc ggt ggt 864 
Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin Met Pro Gly Gly 
250 255 260 265 

ggt gtt ggt att ttc aat ggt tgc tec aag caa tgg ggt get ccc aat 912 
Gly Val Gly He Phe Asn Gly Cys Ser Lys Gin Trp Gly Ala Pro Asn 

270 275 280 

gac ggt tgg ggc teg aga tac ggt ggt att tct tct gca tct gac tgc 950 
Asp Gly Trp Gly Ser Arg Tyr Gly Gly He Ser Ser Ala Ser Asp Cys 
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10 



IS 



20 



30 



35 



40 



50 



285 290 295 

tct agt ctt ccl tec gca etc caa get ggt tgt aaa tgg aga Itc aac 1008 
Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys Trp Arg Phe Asn 

300 305 310 

tgg ttc aag aac get gat aac cca age atg act tac aag gaa gtt acc 1056 
Trp Phe Lys Asn Ala Asp Asn Pro Ser Met Thr Tyr Lys Glu Val Thi 

315 320 325 

tgt ccc aag gaa ate acc gcc aag aca ggt tgt tea aga aaa taa noi 
Cys Pro Lys Glu lie Thr Ala Lys Thr Gly Cys Ser Arg Lys 
330 335 340 

<210> 5 
<211> 360 
<212> P8T 

<213> Rhizopus oryzae CP96001 
<220> 

<221> sig#peptide 
<222> (-23)... (-1) 

<220> 

<221> nattpeptide 
<222> (1)... (337) 

<400> 5 

Met Lys Phe Leu Thr He Ala Ser Ser Ala He Leu Ala Leu Ala Yal 

-20 -15 -10 

Gly Thr Glu Met Ala His Ala Ala Glu Cys Ser Lys Ala Tyr Tyr Gin 
-5 1 5 • 
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Cys Gly Gly Lys Asn Trp Asp Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 IS 20 25 

Thr Cys Val Asp Tyr Pro Asp Asn Pro Phe Tyr Ser Gin Cys Val Pro 

30 35 40 

Asn Glu Asn Leu Thr Ser Thr Asn Lys Ser Ser His Lys Thr Thr Thr 

« 50 55 

Thr Glu Ser Ala Lys Lys Thr Thr Thr Thr Lys Gly Ser Lys Lys Thr 

60 65 70 

Thr Thr Thr Glu Ala Ser Lys Lys Thr Thr Thr Thr Glu Ala Ser Lys 

75 80 85 

Lys Thr Thr Thr Thr Glu Ala Ser Lys Lys Thr Thr Thr Thr Thr Lys 
90 95 100 

Lys Ala Ser Thr Ser Thr Ser Ser Ser Ser Ser Ser Ala Ser Thr Asn 

110 115 120 

Tyr Ser Ala Val Ser Gly Gly Ala Ser Gly Asn Gly Glu Thr Thr Arg 

125 130 
Tyr Trp Asp Cys Cys Lys Fro Ser Cys Ser Trp Pro Gly Lys Ala Asp 

140 145 150 

Val Thr Ser Pro Val Gly Ser Cys Asn Lys Asp Gly Lys Thr Leu Ala 

155 160 165 

Asp Asn Asn Thr Gin Asn Gly Cys Val Gly Gly Ser Ser Tyr Thr Cys 
170 175 180 ,85 

Asn Asp Asn Gin Pro Trp Val Val Ser Asp Asp Leu Ala Tyr Gly Phe 

190 195 200 

Ala Ala Ala Ser lie Ser Gly Gly Ser Glu Ala Thr Trp Cys Cys Ala 

205 210 215 

Cys Phe Glu Leu Thr Phe Thr Ser Thr Ala Val Lys Gly Lys Lys Met 

220 225 230 

Val Val Gin Val Thr Asn Thr Gly Ser Asp Leu Gly Ser Asn Thr Gly 
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235 240 245 

Ala His Phe Asp Leu Gin Met Pro Gly Gly Gly Val Gly lie Tyr Asb 
250 255 260 265 

Gly Cys Ala Thr Gin Trp Gly Ala Pro Thr Asp Gly Trp Gly Ala Arg 

270 275 280 

Tyr Gly Gly Val Ser Ser Ala Ser Asp Cys Ser Asa Leu Pro Ser Ala 

285 290 295 

Leu Gin Ala Gly Cys Lys Trp Arg Phe Gly Trp Phe Lys Asn Ala Asp 

300 305 310 

Asn Pro Thr Met Thr Tyr Lys Gin Val Thr Cys Pro Lys Ala He Thr 

315 320 325 

Ala Lys Ser Gly Cys Ser Arg Lys 
330 335 

<210> 6 
<21l> 1083 
<212> DNA 

<213> Rhizopus oryzae CP96001 
<220> 

<221> siglpeptide 
<222> (D... (69) 

<220> 

<22l> Dat»peptide 
<222> (70)... (1083) 

<400> 6 

atg aag tic cU acc alt gcc Icc tec get ate itg gca ctt gcc gtc 
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Met Lys Phe Leu Thr He Ala Ser Ser Ala He Leu Ala Leu Ala Val 

-20 -15 -10 

ggt act gaa atg gcc cat get get gaa tgt age aag get tac tac caa 96 
Gly Thr Glu Met Ala His Ala Ala GIu Cys Ser Lys Ala Tyr lyr Gin 

-5 1 5 

tgt ggt ggt aag aac tgg gat gga cct acc tgc tgt gaa tct ggc tct 144 
Cys Gly Gly Lys Asn Trp Asp Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

act tgc gtt gat tat cct gac aat cct ttc tac tec caa tgt gtt ccc 192 
Thr Cys Val Asp Tyr Pro Asp Asn Pro Phe Tyr Ser Gin Cys Val Pro 

30 35 40 

aat gaa aae etc acc tee act aac aaa tct tct cac aaa ace acc act 240 
Asn Glu Asn Leu Thr Ser Thr Asn Lys Ser Ser His Lys Thr Thr Thr 

45 50 55 

act gag agt gcc aag aag act acc act act aaa ggt tec aag aag acc 288 
Thr Glu Ser Ala Lys Lys Thr Thr Thr Thr Lys Gly Ser Lys Lys Thr 

60 65 70 

acc act act gaa gcc tct aag aag acc acc act act gaa got tec aag 336 
Thr Thr Thr Glu Ala Ser Lys Lys Thr Thr Thr Thr Glu Ala Ser Lys 

75 80 85 

aag acc acc act act gaa gcc tct aag aag acc acc act act act aag 384 
Lys Thr Thr Thr Thr Glu Ala Ser Lys Lys Thr Thr Thr Thr Thr Lys 
90 95 100 105 

aag get tct acc tec act tec tct tec tct tct tct get tct aca aac 432 
Lys Ala Ser Thr Ser Thr Ser Ser Ser Ser Ser Ser Ala Ser Thr Asn 

110 115 120 

tac tec get gtc tct ggt ggt gcc tec ggt aat ggt gaa acc act cgc 480 
Tyr Ser Ala Val Ser Cly Gly Ala Ser Gly Asn Gly Glu Thr Thr Arg 
125 130 135 
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tac tgg gat tgt tgt aag cct tct tgc agt tgg ccc ggt aag get gat 528 
Tyr Trp Asp Cys Cys Lys Pro Ser Cys Ser Trp Pro Gly Lys AJa Asp 

MO 145 150 

gtc acc tec cct gtt ggc tec tgt aac aag gat ggt aag act ctt got 576 
Val Tlir Ser Pro Val Gly Ser Cys Asn Lys Asp Gly Lys Ihr Leu Ala 

155 160 165 

gat aac aac act caa aac ggc tgt gtt ggt ggt age age tac acc tgt 624 
Asp Asn ASD Thr Gin Asn Gly Cys Val Gly Gly Ser Ser Tyr Thr Cys 
iro 175 180 

aat gac aat caa cet tgg gtt gtt age gac gac ctt gee tac ggt ttc 672 
Asn Asp Asn Gin Pro Trp Val Val Ser Asp Asp Leu Ala Tyr Gly Phe 

190 195 200 

gcc get get tec att tct ggt ggt age gaa get act tgg tgt tgt gcc 720 
Ala Ala Ala Ser He Ser Gly Gly Ser Glu Ala Thr Trp Cys Cys Ala 

205 210 215 

tgt ttc gaa etc aca ttc acc tct act gcc gtc aag ggt aag aag atg • 768 
Cys Phe Glu Leu Thr Phe Thr Ser Thr Ala Val Lys Gly Lys Lys Met 

220 225 230 

gtt gtt caa gta acc aac act ggt tct gac ctt ggc tct aac act ggt 816 
Val Val Gin Val Thr Asn Thr Gly Ser Asp Leu Gly Ser Asn Thr Gly 

235 240 245 

get cac ttt gac ttg caa atg ccc ggt ggt ggt gtt ggt ate tac aat 864 
Ala His Phe Asp Leu Gin Met Pro Gly Gly Gly Val Gly He Tyr Asn 
250 255 260 265 

ggt tgt gcc act caa tgg ggt get ccc acc gat ggt tgg ggt gca aga 912 
Gly Cys Ala Thr Gin Trp Gly Ala Pro Thr Asp Gly Trp Gly Ala Arg 

270 275 280 

tac ggc ggt gtt tct tct gcc tct gac tgt tct aac ctt cct tct gcc 960 
Tyr Gly Gly Val Ser Ser Ala Ser Asp Cys Ser Asn Leu Pro Ser Ala 
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285 290 
^ ctt caa get ggt tgt aag tgg aga ttc ggc tgg ttc aaa aac get gat 

leu Gin Ala 61y Cys tys Trp Arg Phe Gly Trp Phe Lys Asn Ala Asp 
300 305 
,0 aac cca acc atg acc tac aaa caa gtt acc tgt ccc aag get ate act 
Asn Pro Thr Met Thr lyr Lys Glo Val Thr Cys Pro Lys Ala lie Thr 

315 320 325 

gcc aag tct ggc tgt tea aga aaa taa 
Ala Lys Ser Gly Cys Ser Arg Lys . 
330 335 



15 



20 



25 



30 



<210> 7 
<211> 338 
<212> PRT 

<213> Mucor circinelloides CP9900I 



<220> 

<221> siglpeptide 
^ <222> (-22)... (-1) 



1008 



1056 



1083 



<220> 

<22I> !iiat«peplide 
<222> 0)... (316) 

<400> 7 

Met Lys Phe Thr Val Ala He Thr Ser lie Ala Val Ala Leu Ala Leu 

-20 -15 _io 

Ser Ser Ser Ala Clu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin Cys 
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Ciy Gly He Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly Ser Ihr 

15 20 25 

Cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu Pro Gly 

30 35 40 

Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys Thr Ser 

45 50 55 

Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val Thr Thr 

60 65 70 

Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr Ser Thr 
75 80 85 90 

Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr Lys Val 

95 100 105 

He Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr Trp Asp 

110 lis 120 

Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser Val Thr Gly 

125 130 135 

Pro Val Asp Thr Cys Ala Ser Asn Gly lie Ser Leu Leu Asp Ala Asn 

140 145 150 

Ala Gin Ser Gly Cys Asn Gly Gly Asn Gly Phe Mel Cys Asn Asn Asn 
155 160 165 170 

Gin Pro Trp Ala Val Asn Asp Glu Leu Ala Tyr Gly Phe Ala Ala Ala 

175 180 185 

Ser lie Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Gly Cys Tyr Glu 

190 195 200 

Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Mel Val Val Gin 

205 210 215 

Val Thr Asn Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp Leu Gin 

220 225 230 

Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys Ala Ala Gin Trp 
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235 240 245 250 

Gly Ala Pro Asn Asp Gly Trp Gly Ala Arg Tyr GJy Gly Val Ser Ser 

255 260 265 

Val Ser Asp Cys Ala Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys 

270 275 280 

Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met Thr Phe 

285 290 295 

Lys GIu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys Glu 

300 305 310 

Arg Lys 
31S 

<210> 8 
<211> 1017 
<212> DNA 

<213> Mucor circlnelloides CP99001 
<220> 

<221> sigUpeptide 
<222> (I)... (66) 

<220> 

<221> matSpeptide 
<222> (67)... (1017) 

<400> 8 

atg aag ttc acc gtl get att acl tea ate get gtt gca etc get etc 

Mel Lys Phe Thr Val A|a He Thr Ser lie Ala Val Ala Leu Ala Leu 
-20 -15 -10 
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age let tct get gaa get get tcl tgc age tct gtc tat ggt caa tgt 
Ser Ser Ser Ala GJu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin Cys 
-5 15 10 

ggt gge att gga tgg agt gga cot acc tgt tgt gaa agt ggc tct act 
Gly Gly lie Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly Ser Thr 

15 20 . 25 

tge gtt get caa gaa ggc aac aaa tac tac tct caa tgt ctt ccc gga 
Cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu Pro Gly 

30 35 40 

tec cac agt aac aat get ggt aac gel age age acc aag aag aca tct 
Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys Thr Ser 

45 50 55 

acc aag aca tct act acc acc gee aag get act get act gtc acc acc 
Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val Thr Thr 

60 65 70 

aag aca gta acc aag aca act acc aag aca act acc aag act age act 
Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr Ser Thr 
75 80 85 90 

act gcc get get tcl act tec acc tct tct tct get ggt tac aag gtc 
Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr Lys Val 

95 100 105 

ate tct ggc ggt aaa tct ggc agt ggt tec aca act cgt tat tgg gat 
He Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr Trp Asp 

110 115 120 

tgt tgt aaa get tcl tgc age tgg eel gga aaa get tct gtc act ggt 
Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser Val Thr Gly 

125 130 135 

cct gtt gac acc tgt gee tee aat ggt ate tct tta tta gat gee aat 528 
Pro Val Asp Thr Cys Ala Ser Asn Gly He Ser Leu Leu Asp Ala Asn 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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140 14S 
get caa agt ggt tgt aac ggt 
Ala Gin Ser Gly Cys Asn Gly 
I5S 160 
caa cct tgg get gtc aat gat 
Gin Pro Trp Ala Val Asn Asp 
175 

tci att get ggc tec aac gaa 
Ser He Ala Gly Ser Asn GIu 
190 

ttg acc itc act tct ggc get 
Leu Thr Phe Thr Ser Gly Ala 
205 

gtt acc aac acc ggt ggc gat 
Val Thr Asn Thr Gly fily Asp 
220 225 
atg ccc ggt ggt ggc gtt ggt 
Met Pro Gly Gly Gly Val Gly 
235 . 240 

ggc get ccc aat gat ggc tgg 
Gly Ala Pro Asn Asp Gly Trp 
255 

gtc tct gac tgt gee tct ctt 
Val Ser Asp Cys Ala Ser Leu 
270 

tgg aga ttc aac tgg ttc aag 
Trp Arg Phe Asn Trp Phe Lys 
285 

aag gaa gtt acc tgt cct get 



150 



ggt aat ggt ttc atg tgt aac aac aac 576 
Gly Asn Gly Phe Met Cys Asn Asn Aso 
165 170 
gag etc get tac ggt ttc gel get gee 624 
Glu Leu Ala Tyr Gly Phe Ala Ala Ala 

180 185 
get gga tgg tgt tgt ggc tgt tat gaa 672 
Ala Gly Trp Cys Cys Gly Cys Tyr Gtu 

195 200 
get tct gga aag aag atg gtt gtt caa 720 
Ala Ser Gly Lys Lys Met Val Val Gin 
210 215 
tta ggc tct aac cac ttt gat ttg caa 768 
Leu Gly Ser Asn His Phe Asp Leu Gin 
230 

ate ttc aat ggc tgt get get caa tgg 816 
He Phe Asn Gly Cys Ala Ala Gin Trp 
245 250 
gga get aga tat ggt ggt gtc age tct 864 
Gly Ala Arg Tyr Gly Gly Val Ser Ser 

260 265 
ccc tct get ctt caa gel ggt tgt aaa 912 
Pro Ser Ala Leu Gin Ala Gly Cys Lys 

275 280 
aac tct gat aac cct acc atg acc ttc 960 
Asn Ser Asp Asn Pro Thr Met Thr Phe 
290 295 
gaa tta act act cgc tea ggt tge gaa 1008 
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lys Glu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys G] 

300 305 310 

aga aag taa 
Arg Lys 
315 



<210> 9 
<21I> 387 
<212> PRT 

<213> Mucor circineJloides CF99001 



<220> 

<221> sig#peptide 
<222> (-22)... (-1) 



<220> 

<221> matSpeptlde 
<222> (1) . . . (365) 



<400> 9 

Mel Lys Phe Thr Val Ala He Thr Ser lie Ala Val Ala Leu Ala Leu 

-20 -15 _,o 

Ser Ser Ser Ala Glu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin Cys 

' 5 ,0 

Gly Gly He Gly Trp Thr Gly Pro Thr Cys Cys Asp Ala Gly Ser Thr 

'5 • 20 25 

Cys Lys Ala Gin Lys Asp Asn Lys Tyr Tyr Ser Gin Cys He Pro Lys 

30 35 40 

Pro Lys Gly Ser Ser Ser Ser Ser Ser Cys Ser Ser Val Tyr Ser Gin 
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45 50 55 

Cys Gly Gly He Gly Trp Ser Gly Pro Thr Cys Cys GIu Ser Gly Ser 

60 65 70 

Thr Cys Val Ala Gin Glu Giy Asn Lys Tyr Tyr Ser Gin Cys leu Pro 
75 80 85 90 

Gly Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys Thr 

95 100 105 

Ser Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val Thr 

no 115 120 

Thr Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Ihr Thr Lys Thr Ser 

125 130 135 

Thr Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr Lys 

140 145 ISO 

Val He Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr Trp 
155 160 165 170 

Asp Cys Cys lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser Val Thr 

175 180 185 

Gly Pro Val Asp Thr Cys Ala Ser Asn Gly He Ser Leu Leu Asp Ala 

190 195 200 

Asn Ala Gin Ser Gly Cys Asn Gly Gly Asn Gly Phe Met Cys Asn Asn 

205 210 215 

Asn Gin Pro Trp Ala Val Asn Asp Glu Leu Ala Tyr Gly Phe Ala Ala 

220 225 230 

Ala Ser He Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Gly Cys Tyr 
235 240 245 250 

Glu Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Met Val Val 

255 260 265 

Gin Val Thr Asn Thr Cly Gly Asp Leu Gly Ser Asn His Phe Asp Leu 
270 275 280 
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Gin Met Pro Gly Gly Cly Val Gly He Phe Asn Gly Cys Ala Ala Gin 

285 290 295 

Trp Gly Ala Pro Aso Asp Gly Trp Gly Ala Arg Tyr Cly Gly Val Ser 

300 305 310 

Ser Val Ser Asp Cys Ala Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys 
315 320 325 330 

Lys Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met Thr 

335 340 345 

Phe Lys Glu Yal Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys 
350 355 360 

Glu Arg Lys 
365 

<210> 10 
<211> 1164 
<212> DNA 

<213> Mucor circinelloides CP99001 
<220> 

<22]> sigfpeptide 
<222> {!)... (66) 

<220> 

<221> matfpeptide 
<222> (67)... (1164) 

<400> 10 

atg aag ttc acc gtt get att act tea ate get gtt gca etc get etc 
Met Lys Phe Thr Val Ala He Thr Ser He Ala Val Ala Leu Ala Leu 
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-20 -15 -10 

age tct let get gaa get get let tge age tct gtc tat ggt caa tgt 96 
Ser Ser Ser Ala Glu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin Cys 
-5 15 10 

ggt ggc att ggc tgg act ggt cct aca tgt tgt gat get gga teg acc 144 
Gly Gly He Giy Trp Thr Gly Pro Thr Cys Cys Asp Ala Gly Ser Tlir 

IS 20 25 

tgt aaa get caa aag gat aac aaa tat tat tct caa tgt att ccc aaa 192 
Cys Lys Ala Gin Lys Asp Asn Lys Tyr Tyr Ser Gin Cys lie Pro Lys 

30 35 40 

ccc aag ggt tec tec tea tea tea tea tgt agt tec gtc tat agt caa 240 
Pro Lys Gly Ser Ser Ser Ser Ser Ser Cys Ser Ser Val Tyr Ser Gin 

45 50 55 

tgc ggt ggc att gga tgg agt gga cct acc tgt tgt gaa agt ggc tct 288 
Cys Gly Gly He Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly Ser 

60 65 70 

act tgc gtt get caa gaa ggc aac aaa tac tac tct caa tgt ctt ccc 336 
Thr Cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu Pro 
75 80 85 90 

gga tec cac agt aac aat get ggt aac get age age acc aag aag aca 384 
Gly Ser His Ser Asn Asn Ala Gly Asa Ala Ser Ser Thr Lys Lys Thr 

95 100 105 

tct ace aag aca tct act acc acc gee aag gel act get act gtc acc 432 
Ser Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val Thr 

no 115 120 

ace aag aca gta ace aag aca act ace aag aca act acc aag act age 480 
Thr Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr Ser 

125 130 135 

act act gcc get get tct act tec acc tct tct tct get ggt tac aag 528 
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Thr Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Cly Tyr Lys 

140 145 150 

gtc ate tct ggc ggt aaa let ggc agt ggl tec aca act cgt tat Igg 576 
Val He Ser Gly Gly Lys Ser Cly Ser Gly Ser Thr Thr Arg Tyr Trp 
155 160 165 170 

gat tgt tgt aaa get tct tgc age tgg cct gga aaa get tct gtc act 6Z4 
Asp Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser Yal Thr 

175 180 185 

ggt cct gtt gac aec tgt gcc tec aat ggt ate tct tta tta gat gee 672 
Gly Pro Val Asp Thr Cys Ala Ser Asn Gly He Ser Leu Leu Asp Ala 

190 195 200 

aat get caa agt ggt tgt aac ggt ggt aat ggt ttc atg tgt aac aac 720 
Asn Ala Gin Ser Gly Cys Asn Gly Gly Asn Gly Phe Met Cys Asn Asn 

205 210 215 

aac caa cct tgg get gtc aat gat gag etc get tac ggt ttc get get 768 
Asn Gin Pro Trp Ala Val Asn Asp Glu Leu Ala Tyr Gly Phe Ala Ala 

220 225 230 

gcc tct att get ggc tec aac gaa get gga tgg tgt tgt ggc tgt tat 816 
Ala Ser He Ala Cly Ser Asn Glu Ala Gly Trp Cys Cys Gly Cys Tyr 
235 240 245 250 

gaa ttg ace ttc act tct ggc get get tct gga aag aag atg gtt gtt 864 
Glu Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Met Yal Val 

255 260 265 

caa gtt ace aac acc ggt ggc gat tta ggc tct aac cae ttt gat ttg 912 
Gin Val Thr Asn Thr Gly Cly Asp Leu Gly Ser Asn His Phe Asp Leu 

270 275 280 

caa atg cec ggt ggl ggc gtt ggt ate ttc aat ggc tgt get get caa 960 
Gin Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cvs Ala Ala Gin 
285 290 295 
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tgg ggc get ccc aat gat ggc tgg gga get aga tat ggt ggt gtc age 1008 
Trp Gly Ala Pro Asn Asp Gly Trp Gly Ala Arg Tyr Gly Gly Val Ser 

300 305 310 

tct gtc tct gac tgt gcc tot ctt ccc tct get ctt caa get ggt tgt 1056 
Ser Val Ser Asp Cys Ala Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys 
315 320 325 330 

aaa tgg aga ttc aac tgg ttc aag aac tct gat aac cot ace atg ace 1104 
Lys Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met Thr 

335 340 345 

ttc aag gaa gtt acc tgt cot get gaa tta act act cgc tea ggt tgc 1152 
Phe Lys Glu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys 

350 355 360 

gaa aga aag taa 1164 
Glu Arg Lys 
365 

<210> 11 
<211> 346 
<212> PRT 

<213> Phycomyces nitens CP99002 

<220> 

<221> sigtpeptide 
<222> (-19)... (-1) 

<220> 

<221> nattpeptide 
<222> (D... (327) 
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<400> 11 

Met Lys Phe Ser He He Ala Ser Ala Leu Leu Leu Ala Ala Ser Ser 

-15 -JO -5 

Thr Tyr Ala Ala Glu Cys Ser Gin Gly Tyr Gly Gin Cys Gly Gly Lys 

1 5 10 

Met Trp Thr Gly Pro Thr Cys Cys Thr Ser Gly Phe Thr Cys Val Gly 

15 20 25 

Ala Glu Asn Asn Glu Trp Tyr Ser Gin Cys He Pro Asn Asp Gin Val 
30 35 40 45 

GiD Gly Asn Pro Lys Thr Thr Thr Thr Thr Thr Thr Lys Ala Ala Thr 

50 55 60 

Thr Thr Lys Ala Pro Val Thr Thr Thr Lys Ala Thr Thr Thr Thr Thr 

65 70 75 

Thr Lys Ala Pro Val Thr Thr Thr Lys Ala Thr Thr Thr Thr Thr Thr 

80 85 90 

Lys Thr Thr Thr Lys Thr Thr Thr Thr Lys Ala Ala Thr Thr Thr Ser 

95 100 IDS 

Ser Ser Asn Thr Gly Tyr Ser Pro He Ser Gly Gly Phe Ser Gly Asn 
110 115 120 125 

Gly Arg Thr Thr Arg Tyr Trp Asp Cys Cys Lys Pro Ser Cys Ala Trp 

130 135 140 

Asp Gly Lys Ala Ser Val Thr Lys Pro Val Leu Thr Cys Ala Lys Asp 

145 150 155 

Gly Val Ser Arg Leu Gly Ser Asp Val Gin Ser Gly Cys Val Gly Gly 

160 165 170 

Gin Ala Tyr Met Cys Asn Asp Asn Gin Pro Trp Val Val Asn Asp Asp 

175 180 185 

Leu Ala Tyr Gly Phe Ala Ala Ala Ser Leu Gly Ser Ala Gly Ala Ser 
190 195 200 205 
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Ala Phe Cys Cys Gly Cys Tyr Glu Leu Thr Phe Thr Asn Thr Ala Val 

210 215 220 

Ala Gly Lys Lys Phe Val Val Gin Val Thr Asn Thr Gly Asp Asp Leu 

225 230 235 

Ser Thr Asn His Phe Asp Leu Gin Met Pro Gly Gly Gly Val Gly Tyr 

240 245 250 

Phe Asn Gly Cys Gin Ser Gin Trp Asn Thr Asn Thr Asp Gly Trp Gly 

255 260 265 

Ala Arg Tyr Gly Gly He Ser Ser He Ser Glu Cys Asp Lys Leu Pro 
270 275 280 285 

Thr Gin Leu Gin Ala Gly Cys Lys Trp Arg Phe Gly Trp Phe Lys Asn 

290 295 300 

Ala Asp Asn Pro Glu Val Thr Phe Lys Ala Val Thr Cys Pro Ala Glu 

305 310 315 

lie lie Ala Lys Thr Gly Cys Glu Arg Lys 
320 325 



<2I0> 12 
<211> 1041 
<212> DNA 

<213> Pbyconyces nitens CP99002 
<220> 

<221> siglpeptide 
<222> ID... (57) 

<220> 

<221> fliatfpeptide 
<222) (58)... (1041) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



<400> 12 
atg aag ttc 
Met Lys Pbe 

act tac get 
Thr lyr Ala 

atg tgg act 
Met Trp Thr 

15 

gcc gaa aac 
Ala Glu Asn 
30 

cag ggt aac 
Gin Gly Asn 

acc acc aag 
Thr Thr Lys 

acc aag gcc 
Thr Lys Ala 
80 

aag acc acc 
Lys Thr Thr 
95 

tct tec aac 
Ser Ser Asn 
110 



tec ate 
Ser lie 
-15 
get gaa 
Ala Glu 
1 

ggt ccc 
Gly Pro 

aac gag 
Asn Glu 

ccc aag 
Pro Lys 
50 

get cct 
Ala Pro 
65 

cct gtc 
Pro Val 

acc aag 
Thr Lys 

act ggc 
Thr Gly 



ate get tec 
He Ala Ser 

tgc age caa 
Cys Ser Gin 
5 

acc tgc tgc 
Thr Cys Cys 
20 

tgg tac tct 
Trp Tyr Ser 
35 

acc acc acc 
Thr Thr Thr 

gtc acc acc 
Val Thr Thr 

acc acc acc 
Thr Thr Thr 
85 

acc acc acc 
Thr Thr Thr 

100 
tac age ccc 
Tyr Ser Pro 

lis 



gcc ctt 
Ala Leu 
-10 
ggc tat 
Gly Tyr 

acc tec 
Thr Ser 

cag Igt 
Gin Cys 

acc ace 
Thr Thr 
55 

acc aag 
Thr Lys 
70 

aag gcc 
Lys Ala 

acc aag 
Thr Lys 

att tct 
He Ser 



etc etc 
Leu Leu 

ggc cag 
Gly Gin 

ggc ttc 
Gly Phe 
25 

ate ccc 
lie Pro 
40 

acc acc 
Thr Thr 

gcc acc 
Ala Thr 

act act 
Thr Thr 

get gcc 
Ala Ala 
105 
ggt ggc 
Gly Gly 
120 



get gcc age tec 
Ala Ala Ser Ser 

-5 

tgt ggt ggc aag 
Cys Gly Gly Lys 
10 

acc tgt gta ggt 
Thr Cys Val Gly 



aac gat 
Asn Asp 

aag get 
Lys Ala 

acc acc 
Thr Thr 
75 

act acc 
Thr Thr 
90 

acc ace 
Thr Thr 



caa gtc 
Gin Val 
45 

gcc act 
Ala Thr 
60 

acc acc 
Thr Thr 



48 



96 



144 



192 



240 



288 



acc acc 336 
Thr Thr 

acc tec 384 
Thr Ser 



ttc tct gga aac 
Phe Ser Gly Asn 
125 



432 
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ggt cgc act acc cgc tac tgg gat tgc tgc aag ccc tct tgc gcc Igg 480 
Gly Arg Thr Thr Arg Tyr Trp Asp Cys Cys Lys Pro Ser Cys Ala Trp 

130 135 140 

gac gga aag get tct gta act aag cct gta etc acc tgt gcc aag gat 528 
Asp Gly Lys Ala Ser Yal Thr Lys Pro Yal Leu Thr Cys Ala Lys Asp 

145 ISO 155 

ggt gtc age cgt etc ggt tec gat gtc cag age ggt tgc gtc ggc ggc 576 
Gly Yal Ser Arg Leu Gly Ser Asp Yal Gin Ser Gly Cys Yal Gly Gly 

'160 165 170 

cag gcc tac atg tgc aat gac aac cag ccc tgg gtt gtc aat gac gac 624 
Gin Ala Tyr Met Cys Asn Asp Asn Gin Pro Trp Val Yal Asn Asp Asp 

175 180 18S 

ctt gcc tac ggt ttc get get gcc agt etc ggt age gee ggt gcc tct 672 
Leu Ala Tyr Gly Pbe Ala Ala Ala Ser Leu Gly Ser Ala Gly Ala Ser 
190 195 200 205 

gea ttc tgc tgc ggc tgt tac gag ctt acc ttc acc aac act get gtc 720 
Ala Phe Cys Cys Gly Cys Tyr Glu Leu Thr Phe Thr Asn Thr Ala Yal 

210 215 220 

get ggc aag aag ttt gtc gtc cag gtc acc aac acc ggt gat gat etc 768 
Ala Gly Lys Lys Phe Yal Val Gin Yal Thr Asn Thr Gly Asp Asp Leu 

225 230 235 

age ace aac eac ttt gat ttg cag atg cec ggc ggt ggt gtc ggc tac 816 
Ser Thr Asn His Phe Asp Leu Gin Met Pro Gly Gly Gly Yal Gly Tyr 

240 245 250 

ttc aac ggc tgc cag tec cag tgg aac acc aac acc gat ggc tgg ggt 864 
Phe Asn Gly Cys Gin Ser Gin Trp Asn Thr Asn Thr Asp Gly Trp Gly 

255 260 265 

get cgc tat ggc ggt att age tct att tea gag tgc gac aag ctt cct 912 
Ala Arg Tyr Gly Gly lie Ser Ser lie Ser Glu Cys Asp Lys Leu Pro 
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270 275 280 285 

^ acc cag ttg cag get ggt tgc aag tgg aga ttc gga tgg ttc aag aac 960 

Thr Gin leu Gin Ala Gly Cys Lys Trp Arg Phe Gly Trp Phe Lys Asn 
290 295 300 

10 get gac aac cca gag gtc acc ttc aag get gtt act tgc cct gcc gag 1008 
Ala Asp Asn Pro GIu Val Thr Phe Lys Ala Val Thr Cys Pro Ala Glu 
305 310 315 

" ate att gcc aag act ggt tgc gag cgc aag taa 1041 
lie He Ala Lys Thr Gly Cys Glu Arg Lys 
320 325 

20 

<210> 13 
<211> 1043 

25 

<212> DNA 

<213> Artificial SeQuence 

30 

<220> 

<223> Codon-optiuiized sequence corresponding to RCE I 
35 protein {SEQ. ID NO: 2) 

<220> 

<221> sigSpeptide 
<222> (16)... (84) 

<220> 

<221> mattfpeptide 
so <222> (84)... (1043) 

<400> 13 
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ggatcctggg acaag atg aag ttc ate act ate gcc tec tec gee etc ctt 51 
Uet Lys Phe He Thr He Ala Ser Ser Ala Leu Leu 
-20 -15 
gcc etc gcc ctt ggc act gag atg gcc tec gcc get gag tgc tec aag 99 
Ala Leu Ala Leu Gly Thr Glu Met Ala Ser Ala Ala Glu Cys Ser Lys 
-10 -5 1 5 

etc tac gga cag tgc ggc gga aag aac tgg aac ggc ccc acc tgc tgc 147 
Leu Tyr Gly Gin Cys Gly Gly Lys Asn Trp Asn Gly Pro Thr Cys Cys 

10 15 20 

gag age ggc teg acc tgc aag gtc teg aat gac tac tac age eag tgc 195 
Glu Ser Gly Ser Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys 

25 30 35 

ctg ccg age ggc tec teg gga aac aag teg age gag teg gee eac aag 243 
Leu Pro Ser Gly Ser Ser Gly Asn Lys Ser Ser Glu Ser Ala His Lys 

40 45 50 

aag acc acg acc get gee eac aag aag ace acg acc gcc get cac aag 29] 
Lys Thr Thr Thr Ala Ala His Lys Lys Thr Thr Thr Ala Ala His Lys 

55 60 65 

aag act acg acc get ccc gcc aag aag ace acg acc gtc gcc aag get 339 
Lys Thr Thr Thr Ala Pro Ala Lys Lys Thr Thr Thr Val Ala Lys Ala 
70 75 80 85 

leg act ccg tec aac teg age age teg tct teg gga aag tac age get 387 
Ser Thr Pro Ser Asn Ser Ser Ser Ser Ser Ser Gly Lys Tyr Ser Ala 

90 95 100 

gtc age ggt ggc get age ggc aac ggc gtc act acc cgc tac tgg gae 435 
Val Ser Gly Gly Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp 

105 110 lis 

tgc tgc aag get teg tgc teg tgg ccc ggc aag get aac gtc age teg 483 
Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser 
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120 125 130 

cct gtc aag Icc tgc aac aag gac ggc gtc acc get ctl age gac tec 531 
Pro Vai Lys Ser Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser 

135 140 145 

aac gcc cag tec ggc tge aac ggc ggc aac tec tac atg tgc aac gac 579 
Asn Ala Gin Ser Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp . 
150 155 160 165 

aac cag cca tgg get gtc aac gac aac ctt get tac ggt tte get gcc 627 
Asn Gin Pro Trp Ala Val Asn Asp Asa Leu Ala Tyr Gly Phe Ala Ala 

170 175 180 

get gcc att age ggc ggt ggc gag age cgc tgg tgc tgc tec tgc tte 675 
Ala Ala He Ser Gly Gly Gly GIu Ser Arg Trp Cys Cys Ser Cys Phe 

185 190 195 

gag etc acc tte acc tec acc age gtt get ggc aag aag atg gtc gtc 723 
Glu Leu Thr Phe Thr Ser Thr Ser Yal Ala Gly Lys Lys Met Val Val 

200 205 210 

cag gtc acc aac act ggc ggt gac ctt ggc age teg acc ggt gcc eac 771 
Gin Val Thr Asn Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His 

215 220 225 

tte gat etc cag atg ccc ggc ggc ggc gtc ggc ate tte aac gga tgc 819 
Phe Asp Leu Gin Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys 
230 235 240 245 

teg tec cag tgg ggc get ccc aac gac ggc tgg ggc teg cgc tac ggc 887 
Ser Ser Gin Trp Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly 

250 255 260 

ggc ate age tec gcc age gac tge teg tec etc ccc age gcc etc cag 915 
Gly He Ser Ser Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin 

265 270 275 

gcc ggc tgc aag tgg cgc tte aac tgg itc aag aac gee gac aac ccg 963 
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Ala Gly Cys Lys Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Asn Pro 
280 285 290 

s 

tec atg acc tac aag gag gtc acc tgc ccc aag gag ate acc get aag 1011 

Ser Met Thr Tyr Lys Glu Val Thr Cys Pro Lys Glu lie Thr Ala Lys 
10 295 300 305 

acc gga tgc teg cge aag taa acgcagg atec ]043 

Thr Gly Cys Ser Arg Lys 
" 310 315 

<210> 14 
<211> 40 
<212> PRT 

<213> Rblzopus oryzae CP96001 

25 

<m> 14 

go Ala Glu Cys Ser Lys Leu Tyr Gly Gin Cys Gly Gly Lys Asn Trp Asn 
15 10 15 

Gly Pro Thr Cys Cys Glu Ser Gly Ser Thr Cys Lys Val Ser Asn Asp 
35 20 25 30 

Tyr Tyr Ser Gin Cys Leu Pro Ser 
35 40 

40 

<2]0> 15 
<211> 22 
<212> PRT 

<213> Mucor circinelloides CP99001 

so 

<400> 15 

Ala Ser Cys Ser Ser Val Tyr Gly Gin Cys Gly Gly He Gly Trp Ser 
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I 5 10 15 

Gly Pro Thr Cys Cys Glu 
20 

<210> 16 
<211> 23 
<212> PRT 

<213> Phycooyces nitens CP99002 
<4flO> 16 

Ala Glu Cys Ser Gin Gly Tyr Gly Gin Cys Gly Gly Lys Met Trp Thr 

1 5 10 15 

Gly Pro Thr Cys Cys Thr Ser 
20 

<210> 17 
<211> 39 
<212> PUT 

<213> Artificial Sequence 
<220> 

<223> Consensus aiino acid sequence found in cellulose binding 
domains of the laniily 45 endoglucanases 

<220> 
<221> Xaa 

<222> (1).. (6). (11).. (14). (16).. (18). (20).. (25), (27).. (31). 

(33).. (34), (36), (39) 
<223> Xaa is one of all amino acids 
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<300> 

<301> Hoffren. A. -M. et al. 
<303> Protein EngioeeriDg 
<304> 8 
<306> 443-450 
<307> 1995 

<400> 17 

Xaa Xaa Xaa Xaa Xaa Xaa Gin Cys Cly Gly Xaa Xaa Xaa Xaa Gly Xaa 

] S 10 IS 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Asn 

20 25 30 

Xaa Xaa Tyr Xaa Gin Cys Xaa 
35 

<210> 18 
<211> 1257 
<212> DM 

<2]3> Humlcola insolens 
<220> 

<221> in Iron 
<222> (453).. (509) 

<400> 18 

aatgacgggg caacctcccg cccgggccca actcttgggt ttggtttgac aggccgtctg 60 
tctcttgcgt cctcttacta cgcctgcctg gaccctacgt ctcaactccg attcaagatg 120 
cgtlcctccc ctctcctccg ctccgccgtt gtggccgccc tgccggtgtt ggcccttgcc 180 
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gctgatggca agtccacccg clactgggac Igctgcaagc cttcglgcgg clgggccaag 240 
aaggctcccg tgaaccagcc tgtcttctcc tgcaacgcca acttccagcg tctcactgac 300 
ttcgacgcca agtccggctg cgagccgggc ggtgtcgcct actcgtgcgc cgaccagacc 360 
ccatgggctg tgaacgacga cttcgcgttc ggttttgctg ccacctctat tgccggcagc 420 
aatgaggcgg gctggtgctg cgcctgctac gagtaagctt tggtcgcgtg tgtaacactg 480 
tgcaggcata gcactaacca cctcccaggc tcaccttcac atccggtcct gttgctggca 540 
agaagatggt cgtccagtcc accagcactg gcggtgatct tggcagcaac cacttcgatc 600 
tcaacatccc cggcggcggc gtcggcatct tcgacggatg cactccccag ttcggcggtc 660 
tgcccggcca gcgctacggc ggcatctcgt cccgcaacga gtgcgatcgg ttccccgacg 720 
ccctcaagcc cggctgctac tggcgcttcg actggttcaa gaacgccgac aacccgagct 780 
tcagcttccg tcaggtccaa tgcccagccg agctcgtcgc tcgcaccgga tgccgccgca 840 
acgacgacgg caacttccct gccgtccaga tcccctccag cagcaccagc tctccggtcg 900 
gccagcctac cagtaccagc accacctcca cctccaccac ctcgagcccg cccgtccagc 960 
ctacgactcc cagcggctgc actgctgaga ggtgggctca gtgcggcggc aatggctgga 1020 
gcggctgcac cacctgcgtc gctggcagca cctgcacgaa gattaatgac tggtaccatc 1080 
agtgcctgta aacgcagggc agcctgagaa ccttactggt tgcgcaacga aatgacactc 1140 
ccaatcactg tattagttct tgtacataat ttcgtcatcc ctccagggat tgtcacatat 1200 
atgcaatgat gaatactgaa cacaaacctg gccgcttgaa ctggccgaag gaatgcc 1257 

<210> 19 

<Z11> 1720 

<Z12> DNA 

<213> Trichoderina virlde 
<220> 

<221> intron 

<222> (500).. (682) 

<400> 19 
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ggtgtgtcat ttctcctcaa catactgcct 
atatcccaat atcagttttt cccaaagtag 
ttaatcagtg cccatagtgt ctgtctgttg 
ccatggctcc attgctgctt gcagcgtcga 
ctgtctgggg acagtgtgga ggtattggtt 
cagcttgttc tactctcaat ccttattatg 
ccacctcgac ccgacccccc tcgggtccaa 
catctccgcc accgaccagc tctggagttc 
acttcggatg taccacagag tatgtcttca 
gggcggatga tttatagctg tgcgggctgc 
atagttgata ggcggcggtg ttttcttcaa 
gctgatggat gtatgattac agtggcactt 
agaacltcac tggggcaaac aactacccgg 
acgatgatgg gatgactatt ttccgcctac 
atctgggtgg aactctcgat tccaccagta 
gcctgtctct cggtgtatac tgcatcatcg 
gaatcattgg ccagggaggc cctacaaatg 
catcgaagta cgcgtctcag tcgagggtgt 
tgaacatcaa cacttgggct gccacggttc 
gtgclacgtc gcaatacatt tctctgcctg 
ttlccgatgg cagtgcagcc gccctgtctc 
atctaatctt cgatgtccac aagtacttag 
gcactacaaa caacatcgac ggcgcctttg 
accgccaggc t'attctgacg gaaaccggcg 
tgtgccaaca gatccagtac ctcaaccaga 
ggggtgccgg tlcatttgat agcacttata 
actcgtggac ggacacatcc ctagtlagct 
gattgcagga gccttgtcaa tagcgatttc 
agccgcttgt tctgggtatg Iglcttgaca 



ttcaacaact ttcgcctcct ccctggcctg 60 
caagtcatca gtaaatctgc tcatctatca 120 
attgcctccc gccatacacg atgaacagga 180 
tactcttcgg gggcgctgct gcacaacaga 240 
ggagcggacc tacgagttgt gctcctggat 300 
cgcaatgcat tccgggggcc actagtatca 360 
ccaccaccac cagagccacc tcaacgacct 420 
gatttgctgg cgttaacatc gcgggctttg 480 
tgttgcatag tgttgctggc tgagtattct 540 
aaaacaccgc cggtctgcca ctatcaaggc 600 
tcccctgatt acactctcaa gaatctagtg 660 
gcgttacatc gaaggtttat cctccgttga 720 
acggtatcgg ccagatgcag cacttcgtca 780 
ccgtcggatg gcagtacctc gtaaacaaca 840 
tctcgaagta tgatcagctc gttcaggggt 900 
acatccacaa ttatgctcga tggaacggtg 960 
cccagtttac cagtctttgg tcgcagttgg 1020 
ggttcggaat aatgaalgag ccccacgacg 1080 
aagaggtcgt cactgcaatc cgcaacgccg 1140 
gaaatgatta tcaatctgcg gcagctttta 1200 
aggtaacgaa ccctgatgga tcaacaacga 1260 
actcggacaa ctccggtact cacgccgaat 1320 
ctcclctcgc cacttggctt cgacagaaca 1380 
gtggcaatgt tcagtcctgc atccaagatt 1440 
actcagatgt ctatcttggc tatgctggct 1500 
ttctgacgga aacgcctact ggaagcggta 1560 
cgtglctcgc caggaagtaa caccgaggtc 1620 
atcttgctgt acataattct lactctctga 1680 
ggttlctaga 1720 
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<210> 20 
» <211> 20 
<212> DNA 

<213> Artificial Seauence 

10 

<220> 

15 <223> Description of Artificial Sequence: Primer 
<400> 20 

20 

caccacgcgc tactgggact 20 

^ <210> 21 

<211> 19 

<212> DNA 
30 <213> Artificial Sequence 

<220> 

^ <223> Description of Artificial SequencerPrimer 
<400> 21 

ggatcctgcg tttacttgc ]g 

^ <210> 22 
<211> 18 
<212> DNA 

so 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial SeauencerPriner 

<400> 22 

ggatcctggg acaagatg 

<210> 23 
<2I1> J7 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencerPrioer 
<400> 23 

gcacgacggc ttgcagc 

<210> 24 
<211> 228 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence :RecoiDbi nan t 
protein 

<400> 24 

liiet Gin Leu Pro Leu Thr Thr Leu Leu Tlir Leu Leu Pro Ala Leu Al 
15 10 15 
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Ala Ala Gin Ser Gly Ser Gly Arg Thr Thr Arg Tyr Trp Asp Cys Cys 
20 25 30 

Lys Pro Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser Pro Val 
35 40 45 

Lys Ser Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser Asn Ala 
50 55 60 

Gin Ser Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin 
65 70 75 80 

Pro Trp Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala 
85 90 95 

He Ser Gly Gly Gly Glu Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu 
100 105 110 

Thr Phe Thr Ser Thr Ser Val Ala Gly Lys Lys Met Val Val Gin Val 
115 120 125 

Thr Asn Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe Asp 
130 135 140 

Leu Gin Met Pro Gly Gly Gly Val Gly lie Phe Asn Gly Cys Ser Ser 
145 150 155 160 

Gin Trp Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly He 
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165 170 175 

Ser Ser Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly 
180 185 190 

Cys Lys Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Asn Pro Ser Met 
195 200 205 

Thr Tyr Lys Glu Val Thr Cys Pro Lys Glu He Thr Ala Lys Thr Gly 
210 215 220 

Cys Ser Arg Lys 
225 



<2J0> 25 
<211> 20 
<2I2> DNA 

,<213> Artificial Seauence 
<220> 

<223> Description of Artificial Seauence .Primer 
<400> 25 

gcggatcctg ggacaagatg 

<210> 26 
<211> 24 
<212> DNA 
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<2]3> Artificial Sequence 

<220> 

<223> Description of Artificial Seauence: Primer 
<400> 26 

gcctgcagag cggcggaggc catc 

<210> 27 
<21I> 25 
<212> DMA 

<213> Artificial Seauence 
<220> 

<223> Description of Artificial Seauence iPrioer 
<400> 27 

gcctgcaggg aaagtacagc gctgt 

<210> 28 
<211> 21 
<212> DNA 

<213> Artificial Seauence 

<220> 

<223> Description of Artificial Seauence .Priner 
<400> 28 

gcggatcclg cgtttacttg c 
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<210> 29 
<2I1> 245 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description oi Artificial SequeDce:RecoBbinant 
protein 

<400> 29 

Met Lys Phe He Thr lie Ala Ser Ser Ala Leu Leu Ala Leu Ala Leu 
15 10 15 

Gly Thr Clu Met Ala Ser Ala Ala Leu Gin Gly Lys Tyr Ser Ala Val 
20 25 30 

Ser Gly Gly Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp Cys 
35 40 45 

Cys Lys Pro Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser Pro 
50 55 60 

Val Lys Ser Cys Asn Lys Asp. Gly Val Thr Ala Leu Ser Asp Ser Asn 
65 70 75 80 

Ala Gin Ser Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn 
85 90 95 
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Gin Pro Trp Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala 
100 . 105 110 

Ala He Ser Gly Gly Gly GIu Ser Arg Trp Cys Cys Ser Cys Phe Glu 
115 120 125 

Leu Thr Phe Thr Ser Ihr Ser Val Ala Gly Lys Lys Met Val Val Gin 
130 135 140 

Val Thr Asn Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe 
145 150 155 160 

Asp Leu Gin Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys Ser 
165 170 175 

Ser Gin Trp Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly 
180 185 190 

He Ser Ser Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala 
195 200 205 

Gly Cys Lys Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Asn Pro Ser 
210 215 220 

Met Thr Tyr Lys Glu Val Thr Cys Pro Lys Glu lie Thr Ala Lys Thr 
225 230 235 240 

Gly Cys Ser Arg Lys 
245 
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<210> 30 
<211> 7 
<212> PRT 

<213> Artificial Seauence 
<220> 

<223> Desciiption oi Artificial Sequence tN-teriainus 
scQuence of recombinant protein 

<400> 30 

Gin Ser Gly Ser Gly Arg Tbr 
1 5 



<210> 31 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SeQuence:N-terminus 
sequence of recombinant protein 

<400> 31 

Lys Tyr Ser Ala Val Ser Gly 
1 5 
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<210> 32 
<211> 7 
<2I2> PRT 

<213> Artificial Seauence 
<220> 

<223> Description of Artificial SeQuencetN-terminus 
seauence of recoDbinant protein 

<400> 32 

Ala Val Ser Gty Gly Ala Ser 
1 5 

<210> 33 
<211> 7 
<212> PRT 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Seauence iN-terminus 
sequence of recombinant protein 

<400> 33 

Ser Ala Val Ser Gly Gly Ala 
1 5 
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<210> 34 
<2]1> 7 
<212> PRT 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :N-teriiiiiius 
sequence of reconbinant protein 

<400> 34 

Gly Gly Ala Ser Gly Aso Gly 
1 5 



<210> 35 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencerN-terninus 
sequence of recombinant protein 

<400> 35 

Ala Glu Cys Ser Lys Leu Tyr 
1 5 



<210> 36 
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<211> 20 
<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Description of Artificial Seauence -.Primer 
<400> 36 

taytgggayt gytgyaarcc 

<210> 37 
<211> 20 
<212> DNA 

<2]3> Artificial Seauence 
<220> 

<223> Description of Artificial SeQuencerPrimer 
<220> 

<22I> Dodif iedtbase 
<222> (3) 
<223> i 

<220> 

<221> mod ifiedl base 
<222> (9) 
<223> i 

<400> 37 
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tcogcittna rraaccartc 



<210> 38 
<211> 223 
<212> PRT 

<213> Humicola insolens 
<400> 38 

Met Gin Leu Pro Leu Tbr Thr Leu Leu Tbr Leu Leu Pro Ala Leu Ala 
15 10 15 

Ala Ala Gin Ser Gly Ser Gly Ars Thr Thr Arg Tvr Trp Asp Cys Cys 
20 25 30 

Lys Pro Ser Cys Ala Trp Pro Gly Lys Gly Pro Ala Pro Val Arg Thr 
35 40 45 

Cys Asp Arg Trp Asp Asn Pro Leu Phe Asp Gly Gly Asn Thr Arg Ser 
50 55 60 

Gly Cys Asp Ala Gly Gly Gly Ala Tyr Met Cys Ser Asp Gin Ser Pro 
65 70 75 80 

Trp Ala Val Ser Asp Asp Leu Ala Tyr Gly Trp Ala Ala Val Asn He 
85 90 95 

Ala Gly Ser Asn Glu Arg Gin Trp Cys Cys Ala Cys Tyr Glu Leu Thr 
100 105 110 
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Phe Thr Ser Gly Pro Val AJa Gly Lys Arg Met IJe Val Gin Ala Ser 
115 120 125 

Asa Thr Gly Gly Asp Leu Gly Asn Asn. His Phe Asp He Ala Met Pro 
130 135 140 

Gly Gly Gly Val Gly He Pbe Asn Ala Cys Thr Asp Gin Tyr Gly Ala 
145 150 155 160 

Pro Pro Asn Gly Trp Gly Gin Arg Tyr Gly Gly He Ser Gin Arg His 
165 170 175 

Glu Cys Asp Ala Phe Pro Glu Lys Leu Lys Pro Gly Cys Tyr Trp Arg 
180 185 190 

Phe Asp Trp Phe Leu Asn Ala Asp Asn Pro Ser Val Asn Trp Arg Gin 
195 200 205 

Val Ser Cys Pro Ala Glu He Val Ala Lys Ser Gly Cys Ser Arg 
210 215 220 



<210> 39 
<Z11> 672 
<212> DNA 

<213> Hunicola insolens 
<40D> 39 
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atgcagctcc ccctgaccac gctcctcacc ctcctccccg ccctcgcggc ggcccagtcc 60 

ggcagcggcc gcaccacgcg ctactgggac tgctgcaagc cgtcgtgcgc gtggcccggc 120 

aagggcccgg cgcccgtgcg gacgtgcgac cggtgggaca acccgctgtt cgacggcggc 180 

aacacgcgca gcgggtgcga cgcgggcggc ggcgcctaca tgtgctcgga ccagagcccg 240 

tgggcggtca gcgacgacct ggcgtacggc tgggcggccg tcaacattgc cggctccaac 300 

gagaggcagt ggtgctgcgc ctgctacgag ctgaccttca ccagcgggcc ggtggcgggc 360 

aagaggatga ttgtgcaggc gagcaacacg ggaggcgatt tggggaacaa ccactttgat 420 

attgctatgc ccggcggtgg cgtcggtatc ttcaacgcct gcaccgacca gtacggcgcg 480 

ccccccaacg gctggggcca gcgctacggc ggcatcagcc aacgccacga gtgcgacgcc 540 

ttccccgaga agctcaagcc cggctgctac tggcgctttg actggttcct caacgccgac 600 

aacccgagcg tcaactggcg gcaggtcagc tgcccggccg agattgtggc caagagcggc 660 
tgctcgcgtt aa 672 

<210> 40 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SeQuencetPriner 
<400> 40 

ggggatcctg ggacaagatg cagctccccc tgaccacg 3g 

<210> 41 
<211> 38 
<212> DNA 

<Z13> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence .Priner 
<400> 41 

ggggatcctg catttaacgc gagcagccgc tcttggcc 

<210> 42 
<211> 21 
<212> DNA 

<2]3> Artificial Seauence 
<220> 

<223> Description of Artificial Sequence iPrimer 
<400> 42 

gactgctgca agccgtcgtg c 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Description of Artificial Seauence :PriiDer 
<400> 43 

gttgcacatg taggagttgc 
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Claims 

1 . A protein that is a Zygomycetes-derived endoglucanase lacking the cellulose-binding domain and exhibits endog- 
lucanase activity. 

5 

2. A protein that is a Zygomycetes-6eme6 endoglucanase belonging to family 45 tacking the cellulose-binding domain 
and exhibits endoglucanase activity. 

3. The protein according to claim 1 or 2, wherein the Zygomycetes are microorganisms selected from the group 
10 consisting of those belonging to Rhizopus, Mucor, and Phycomyces. 

4. The protein according to claim 3, wherein the Zygomycetes are microorganisms belonging to Rhizopus. 

5. A protein comprising an amino acid sequence as shown in SEQ ID NO: 1 , 3, 5, 7, 9, or 11 , wherein the cellulose- 
15 binding domain has been deleted therefrom, and exhibiting endoglucanase activity, a modified protein thereof 

exhibiting endoglucanase activity, or a homologue of the protein or the modified protein exhibiting endoglucanase 

activity. 

6. A protein comprising an amino acid sequence as shown in SEQ ID NO: 1 . 3, 5, 7, 9, or 11 , wherein the celiulose- 
20 binding domain has been deleted therefrom, and exhibiting endoglucanase activity. 

7. A gene encoding the protein, modified protein thereof, or homologue of the protein orthe modified protein according 
to any one of claims 1 to 6. 

25 8. An expression vector comprising the gene according to claim 7. 

9. A host cell transformed with the expression vector according to claim 8. 

10. The host cell according to claim 9, which is a filamentous fungus. 

30 

11. The host cell according to claim 10, which is a microorganism belonging to Humicola. 

12. A method for producing a protein comprising steps of culturing the host cell according to any one of claims 9 to 
11 and collecting from the host cell obtained by the step of culturing or its culture product the protein, modified 

35 protein thereof, or homologue of the protein or the modified protein according to any one of claims 1 to 6. 

13. A protein produced by the method according to claim 12. 

1 4. A celiulase preparation comprising the protein, modified protein thereof, or homologue of the protein orthe modified 
40 protein according to any one of claims 1 to 6 and 13. 

15. A method for treating cellulose-containing fabrics comprising a step of bringing cellulose-containing fabrics into 
contact with the protein, modified protein thereof, or homologue of the protein or the modified protein according 
to any one of claims 1 to 6 and 13 orthe celiulase preparation according to claim 14. 

45 

1 6. A method for reducing the rate at which cellulose-containing fabrics become fuzzy or reducing fuzzing in cellulose- 
containing fabrics comprising a step of bringing cellulose-containing fabrk^s Into contact with the protein, modified 
protein thereof, or homologue of the protein or the modified protein according to any one of claims 1 to 6 and 13 
or the celiulase preparation according to claim 14. 

50 

17. A method of weight loss treatment of cellulose-containing fabrics to improve the feel and appearance thereof 
comprising a step of bringing cellulose-containing fabrics into contact with the protein, modified protein thereof, or 
homologue of the protein or the modified protein according to any one of claims 1 to 6 and 13 or the celiulase 
preparation according to claim 14. 

55 

18. Amethod of color clarification of colored cellulose-containing fabrics comprising a step of treating colored cellulose- 
containing fabrics with the protein, modified protein thereof, or homologue of the protein orthe modified protein 
according to any one of claims 1 to 6 and 13 orthe celiulase preparation according to claim 14. 
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19. A method of providing colored cellulose-containing fabrics with partial color change comprising a step of treating 
colored cellulose-containing fabrics with the protein, modified protein thereof, or homologue of the protein or the 
modified protein according to any one of claims 1 to 6 and 13 or the cellulase preparation according to claim 14. 

5 20. A method for reducing the rate at which cellulose-containing fabrics become stiff or reducing stiffness in cellulose- 
containing fabrics comprising a step of treating cellulose-containing fabrics with the protein, modified protein there- 
of, or homologue of the protein or the modified protein according to any one of claims 1 to 6 and 13 or the cellulase 
preparation according to claim 14. 

10 21. The method according to any one of claims 15 to 20, wherein fabrics are treated through the soaking, washing, 
or rinsing thereof. 

22. An additive to detergent comprising the protein, modified protein thereof, or homologue of the protein or the mod- 
ified protein according to any one of claims 1 to 6 and 13 or the cellulase preparation according to claim 14 in a 

IS non-dusting granular fomi or a stabilized liquid fomi. 

23. A detergent composition comprising the protein, modified protein thereof, or homologue of the protein or the mod- 
ified protein according to any one of claims 1 to 6 and 13 or the cellulase preparation according to claim 14. 

20 24. A method of deinking waste paper using a deinking agent wherein the protein, modified protein thereof, or homo- 
logue of the protein or the modified protein according to any one of claims 1 to 6 and 1 3 or the cellulase preparation 
according to claim 14 is used in a step of deinking waste paper. 

25. A method for Improving drainage of paper pulp comprising a step of treating paper pulp with the protein, modified 
25 protein thereof, or homologue of the protein or the modified protein according to any one of claims 1 to 6 and 13 

or the cellulase preparation according to claim 14. 

26. A method for improving digestibility of animal feeds comprising a step of treating animal feeds with the protein, 
modified protein thereof, or homologue of the protein or the modified protein according to any one of claims 1 to 

30 6 and 1 3 or the cellulase preparation according to claim 1 4. 
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